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Preface 


During the last ten or fifteen years, much progress has been made 
in the theory of non-self-adjoint compact linear operators on Hilbert 
space. The present volume is intended to provide a brief (and far 
from exhaustive) introduction to the subject. For the greater part, 
it is a revised version of a graduate course given at the University 
of Pennsylvania during the academic year 1964-65; some further 
material has been added subsequently, notably in sections 4.4 and 
4.5. 

A numeral in square brackets, in the text, refers to the 
bibliography included at the end of the book. Cross references to 
theorems, the principal definitions, etc., are given in decimal 
notation; S 3.2 is the second section of Chapter 3, and Lemma 3.2.6 
is the sixth numbered item in that section. Equations are numbered 
consecutively, starting from (1), throughout each section; the 
twenty-third equation in $3.2 is referred to as (23) within that 
section, but as 3.2(23) in subsequent sections. 

I am indebted to the Trustees of the University of Pennsylvania, 
and to Professors Oscar Goldman and Richard Kadison, for arranging 
a visiting appointment in the Mathematics Department during the 
academic year 1964-65, when the lecture course which forms the 
basis of this volume was given. On two counts, my thanks are due 
to Dr Frank Smithies; first because, as my research supervisor 


during the middle 50s, he aroused my interest in many of the topics 
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in place of the more cumbersome 
{b €B : there exists a in A such that P(a) and f(a)= b}. 


Occasionally, it is convenient to refer to f as ‘the mapping a— f(a)’. 
Suppose that K is either the real field or the complex field, and 


that X and Y are normed spaces over K; the norm in both spaces will 


be denoted by || ||. We recall that a linear operator T from X into Vis 


continuous if and only if it is bounded, in the sense that the set 
t||Tx|| : x €X, ||x|| < 1} of real numbers is bounded above. The 
set ‘B(X, Y) of all bounded linear operators from X into Y is a vector 


space over K, and has a norm defined by 
(1) IIT = sup {|| 7x|| : x € X, Ilzi! <1} 


(equivalently, ||T|| is the smallest real number C such that 
|| 7x|| < C||x|| for each x in X). If Y is a Banach space, then so is 
BCX, Y) [59: p. 163]. 

An element T of B(X, Y) is said to be isometric if || Tx|| = ||x|| 
foreach xin X. If there is an isometric linear operator U from X 
onto Y, then X and Y are said to be isometrically isomorphic, and U 
is described as an isometric isomorphism from X onto Y. 

A linear subspace M of a normed space X is itself a normed 
space. If M is a closed subspace, then the set X/M of all cosets, 
[x] = x+M of M in X is a normed space when the algebraic operations 


and norm are defined by 
alx] = [ax], = {xlily] = [x+y], 


Exil] = inf fllzi| : z e{xh} 
inf {|| x-y] | : y EM}. 


II 
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We call X/M the quotient space of X modulo M. The mapping 
x »{x] is called the canonical mapping from X onto X/M, and is a 
norm decreasing (hence bounded) linear operator. If X is a Banach 
space, then so is X/M [59: pp. 104, 105]. 

If M and N are subspaces of a vector space X, we denote by 


M+N the subspace {x+y : x €M, y €N} of X. 


THEOREM 1.1.1. Suppose that M is a closed subspace of a 
normed space X and N is a finite-dimensional subspace of X. Then 


M+N is a closed subspace of X. 


Proof. The canonical mapping ® from X onto the quotient 
space X/M is continuous. Since M+N is the inverse image under ® 
of the finite-dimensional subspace @(N) of X/M, it suffices to show 


that ®(V) is closed in X/M. For this, we refer to [59: p. 96]. 


If X is a normed space over K, then a linear operator from X into 
K is called a linear functional on X. Since K is a Banach space, it 
follows that the set ‘R(X, K) of all bounded linear functionals on X 
is a Banach space (whether or not X is complete), with norm defined 


by 
Zl = sup flf: xex, |x| < 1. 


We call B(X, K) the dual space of X, and denote it by X*. 

We assume that the reader is familiar with the Hahn-Banach 
theorem, and its immediate consequences, concerning the existence 
and properties of bounded linear functionals (see, for example, 

[59: pp. 185-188]). We recall one result of this type. Suppose that 
X is a normed space, A C X and B C X*. We can define closed 


subspaces A? of X* and B, of X by the equations 
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(2) Ao = {feXx*: Ky) = 0 QEA}, 
(3) B, = txeX : Kx) = 0 (fe Bt. 


It is clear that A C (A); and it follows easily, from the 
Hahn-Banach theorem, that M = (M°) when M is a closed subspace 
of X [59; Theorem 4.3-D, p. 186]. 


Given an element x of a normed space X, we can define a bounded 


linear functional £ on the dual space X* by the equation 
XA = f(x) (f € X*). 


The mapping x —> % is an isometric isomorphism from X onto a 
subspace X, of the second dual space X**(= (X*)*). We call Xo 
the canonical image of X in X**, and say that X is reflexive if 


Xo = x** [59: p. 191]. 
If X and Y are normed spaces over K and B is a subset of 
R(X, Y) such that 
sup {||7|| : T € Bs < 


then it follows trivtally, from the inequality | f(7x) | < ||f|! IIZI] [xli 
that 


sup {| f(Tx)| : Té BS < 


whenever f € Y* and x € X. For Banach spaces, there is a converse 


result, the principle of uniform boundedness. 


THEOREM 1.1.2. Suppose that X and Y are Banach spaces 
over K and BC R(X, Y). If, for each f in Y* and x in X, 


sup {| f(Tx)| : T €B} < 
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then 
sup {||T|| : TEB} < 


Proof. We content ourselves with showing that this theorem 
follows at once from very similar results proved, for example, in 
[59: pp. 201-204]. 


For a fixed element x of X, 
sup {| f(Tx)| : T € BS < 


for each fin Y*. By applying [59: Theorem 4.4-A, p. 202] to the 
subset F={Tx : T € B} of Y, it follows that 


sup {|| Tx|| : T eB} < 
This holds for each x in X so, by [59: Theorem 4.4-E, p. 204], 


sup i||T|| : T eB} < 


1.2. Unordered summation 

Let f be a mapping from a set A into a (real or complex) Banach 
space X. We say that f vanishes at infinity, and write f(a) — 0 as 
a -~> ~, if the following condition is satisfied: given any positive €, 
the set fac A: || f(a|| > e} is finite. When this is so, the set 
fa cA: f(a) £ 0} is finite or countable, since it is the union of the 
finite sets {a c A : ||f(A)|| > I/n} (n = 1, 2, ...). 

We introduce the idea of unordered summation in terms of the 
convergence of nets, following the terminology of [33: p. 65 et seq.]. 
The class Í of all finite subsets of A is directed by the inclusion 
relation 2, and we can define a mapping s from F into X by the 


equation 


(1) {F= È fla) Fef). 
a EF 
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We say that fis summable if the net (s, 2) converges to some 
element x of X; when this is so, we write 
= f(a) =x. 
a CA 
Thus f is summable if and only if there is an element x of X with 


the following property: given any positive €, there exists F(e€) in 


F such that 


(2) |x- È f(a|| <¢ whenever F cF and F D Fe). 
a EF 


We sometimes express the fact that f is summable by saying that 


the sum % f(a) exists. Of course, if the set 
acA 


B=ļac<c A: f(a)#0}is finite, then fis summable and x is the 


usual algebraic sum ©% f(a); for (2) is then satisfied if we take 
a EB 


F( €) to be B, for each positive e. 

Many of the expected manipulative properties of unordered sums 
are readily verified, and will be used in future without comment; for 
example, the reader should have no difficulty in supplying the 


missing proof of the following lemma. 


LEMMA 1.2.1. Suppose that f and g are mappings from a set 
A into a Banach space X, a and f are scalars, and T is a bounded 
linear operator from X into another Banach space Y. 
(i) If fand g are summable, then so is af+Bg, and 
x laffa+Bglat=a È fa+B È gla. 
acA acA acA 
(ii) The function Tof, defined by (Tof) (a) = T(f(a)), is 


summable, and 


= T(f(a)) =T E fla). 
a EA atA 
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LEMMA 1.2.2. Let fbe a mapping from a set A into a Banach 
space X, and let F denote the class of all finite subsets of A. 
Then fis summable if and only if it satisfies the following Cauchy 


ace ` oo CE, 
condition: given any positive €, there exists Gle) in Í such that 


(3) | È f(a)\| < «e whenever F eS andF CA ~ Ge). 
a €F 


Proof. Suppose that f is summable and, for each positive <€, 
choose F(¢) in F so that (2) is satisfied. If F € F and 


F CA ~ FCA, then both the sums 2i f(a) and 
a € F8) 


> f(a) approximate x within 2€; hence their difference, 
a EFUFE) 


> f(a, has norm less than e. This shows that the Cauchy 
a EF 


condition (3) is satisfied, with G(«) = Fe). 

Conversely, suppose that f has the Cauchy property stated in 
the lemma. If F, eS and F, 2 Ge) G = 1, 2) then (with the 
notation of (1)) 


sf )-sGHo {l= i] > f(a) || < the, 


ae GY €) 
and so ||s(F D-s(F,)i| <e. This shows that (s, >) is a Cauchy net; 


since the space X is complete, (s, D) converges, and f is summable. 


COROLLARY 1.2.3. Let fbe a mapping from a set A into a 


Banach space X. If ‘fis summable, then fa) — 0 asa— œ. 


Proof. By taking for F a set with just one member, it follows 
from (3) that fa € A: ||f(a@|| > €} is a subset of G(e) and is therefore 


finite, for each positive «€. 


The preceding discussion applies, in particular, when X is the 


complex field. We now state some results about summability of 


8 COMPACT NON-SELF-ADJOINT OPERATORS 


numerically-valued functions. The proofs are straightforward, and 


are left to the reader. 


LEMMA 1.2.4. Suppose that fis a complex-valued function 
defined on a set A, F is the class of all finite subsets of A, and 


S(F) = mee f(a) whenever F e F. 


a € 
(i) È f(a) exists if andonly if > |f(a)| exists; and 
aCA acA l 
when this is so, | È f(a|< È |fCa)| . 
atA acA 


(ii) If f(a) > 0 for each ain A, then > f(a) exists if and only 
aca 


if the set {s(F): F € F? is bounded above; and when this is So, 


>, f(a) = sup {s(F) : Fef}, 


ac 


In view of part (ii) of Lemma 1.2.4, we sometimes express the 
fact that a non-negative real-valued function f is summable by 
writing % f(a) <o, 

acA 


LEMMA 1.2.5, Suppose that fisa mapping from a set A into 
a Banach space X, and that È I[£(a)|| <œ. Then fis summable 
aCA 


and 
x f < 2 
l 2 f@ll< 2, Ell. 
Proof. Since a [| fCa)|| <œ, it follows from Lemma 1.2.2 
a 
that there exists G(¢) in F such that 
; z I[E(@|| <€ whenever F € F and F CA ~ G(e). 

Hence 


i ae f(a)||<¢ wherever F « F and F CA ~G(e); 
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ind, again by Lemma 1.2.2, f is summable. With s(F) defined by 


(1), we have 


IsI < = [flail < = HEI. 
a€F acA 


Since the net (s, 2) converges to 2% f(a), it follows by taking 


acA 
limits that 


|| È alls 2 fæl. 
a CEA aCA 


If A and B are sets, we denote by A x B the Cartesian product 
of A and B; that is, the set of all ordered pairs (a, b) with ain A 
and b in B. 


LEMMA 1.2.6. Suppose that f is a mapping from the Cartesian 
product A x B of two sets A and B into a Banach space X. If fis 


summable, then the repeated sum 2 a f(a, b)) exists, and 
a EA bEB 


is equal to > f(a, b). If fis a non-negative real-valued 
(a, b) EAXB 


function and the repeated sum exists, then f 1s summable. 
Note. By saying that the repeated sum exists we mean that, 


for each ain A, the sum % f(a, b) = h(a) exists, and the mapping 
bEB 


h from A into X is summable. The proof of Lemma 1.2.6 is 


straightforward, and is left to the reader. 


We conclude this section with a lemma which relates the concept 
of unordered summation to the usual notion of convergence of 
infinite series. Once again, the proof is left as an exercise for the 


reader, 


LEMMA 1.2.7. Let f be a mapping ftom a set A into a Banach 
space X, and let B = ļa €A: f(a) #0}. Then the following two 
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statements are equivalent: 

(i) fis summable. 

(ii) B is finite or countable; if it is countable then, given any 
enumeration Lb, :j=1, 2,...} of B, the infinite series È f(b,) 
converges. 

When these conditions are satisfied, the series in (ii) converge 


to È fla). 
acA 


1.3. The spaces E (A) 
Given a set A and a positive real number p, we denote by EA) 
the set of all complex-valued functions f on A for which the sum 


x |f(a@|P exists (in the sense of $1.2). If f£ g€ E (A), then 
a EA 
f+g E€ E (A) and 


x |f(a+g(al?< 2PL E IDP E Igla P]. 
acA acA aCA 


To prove this it is sufficient, by Lemma 1.2.4 (ii), to establish the 
s 

last inequality in the case where A is replaced by a finite subset 

F; and the required result then follows from the elementary 


inequalities 


[E(D + g(a)|? < Lmax ((2f(a)|, 1280) 
| 


< 
< |2£(a)|P + |28 (aP. 


Hence EA) is a complex vector space, since it is apparent that 


afe ESA) whenever f € E 6A) and @ is a complex number. 


LEMMA 1.3.1. (Minkowski’s inequality). If 1< p <œ and 
f, g E E CA), then 


[ E JEDD PSIE JDPI PA E Iaer., 
aCA acA acA 
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We omit the proof of this result, remarking only that, once it has 
been established for summation over a finite set A, the extension 
for general A is an immediate consequence of Lemma 1.2.4 (ii). 


It follows from Minkowski’s inequality that the equation 


esL E JD (eb (A)) 
a CEA p 


defines a norm on CfA). It is not difficult to verify that, with this 
norm, e (A) is a Banach space when 1< p < ~ [59: p. 100]. 

We denote by f(A) the class of all bounded complex-valued 
functions on A. This is a complex vector space, and is a Banach 


space with respect to the norm || ||,, defined by 
Ifl = sup {lfa : ae A} (fel (A)). 


Clearly l (4) C E (A) (0 <p<q<~). 

Ifp >1, q21 and p`! +g! = l, we say that p and q are 
conjugate indices, and describe each as the conjugate index of the 
other; for example, 1 and œ are conjugate indices, while 2 is its 


own conjugate index. 


LEMMA 1.3.2. (H6lder’s inequality). If 1<px<%, qis the 
conjugate index of p, f el (A) and g € tA then tg (the pointwise 
product, defined by (fg) (a) = f(a)g(a)) is in 2 (A) and 


talli < IAL Ilalla 


Once again we omit the proof, with the comment that the result 
is an immediate consequence of Lemma 1.2.4(ii), as soon as the 


inequality has been established in the case where A is a finite set. 
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Suppose that p and q are conjugate indices and g € f fA). For 
each fin EA), let 


(1) 0. = x f(a) g(a); 
a EA 


this sum exists, and furthermore KAG) <S fil Lally since ° 
x |f(ag(a] < |if|_ ligi] by Lemma 1.3.2. It follows that 9, is 
a EA p q é 


a bounded linear functional on the Banach space E (A), and that 
IKAI < git, In fact, IKA || = lallo and we have the following 
result [59: -p. 194]. 


THEOREM 1.3.3. Suppose that 1 <p <œ and qis the 
conjugate index of p. When g € (fA), let Di be the element of 
(£, (4) )* defined by (1). Then the mapping g —> P, is an isometric 
isomorphism from ACY onto RE )*. 


1.4. Analytic operator-valued functions 

jehroughout $1.4, X and Y are complex-Banach spaces and 
RCX, Y) is the Banach space of all bounded linear operators from X 
into Y. 

Suppose that D is an open subset of the complex field and 9 is a 
function, defined on D, with values in (X, Y). Given f in the dual 


space Y* and x in X, the equation 
(1) Pi A) = FAD) 


defines a complex-valued function P x on D. We say that Mis 
analytic on D if, for every f in Y* and x in X, Pf x is analytic on D. 
If ọ is defined and analytic on the whole complex field, it is said to 
be an entire function. 

Many results concerning an operator-valued analytic function Ọ 


can be deduced by applying to each P; the corresponding theorems 


PRELIMINARIES 13 


in classical complex variable theory. The proof of the following 


theorem illustrates this method. 


THEOREM 1.4.1. Suppose that LE R(X, Y) (n = 0, 1, 2, ...), 
r is a positive real number and D, denotes the disc fA [Al < rh. 
Then the following three conditions are equivalent. a 
(i) For each f in Y*, x in X and À in D, the series MET, x) 
converges. 


(ii) lim sup ||T, [[!/" < 4. 
n> oo 


my 


oO 


(111) For each À in D, the series 5 A” T „ converges with respect 
0 
to the norm of B(X, Y). 
When these conditions are satisfied, the function defined by 


(2) PA= E AMT 


n 
n — 
is analytic on Dp and its expansion in the form (2) is unique. 


Proof. Suppose that (i) is satisfied. Given any real number a 
such that 0 < a <r, the series È a" {(T_x) converges and hence 


a" f(T_,x) — 0 as n — œ, whenever f € Y* and x € X. Thus 
sup ET x)| :n=0,1, 2,...} < 


and, by the principle of uniform boundedness, 


M = sup ila’ 7 || > n=0,1, 2,...$ <0. 
Hence 
Iz < Ma”, Ta < Mi/n a~t, 
and 
lim sup |7,||1/" < Ł 
n 00 a 
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This has been proved whenever 0 < a< r, and is therefore true also 
when a=r. Hence, (i) implies (ii). 

If (ii) is satisfied then, by the elementary theory of numerical 
power series, È |A|? ||7_|| converges whenever |A| < r, and so 
2 ANT converges with respect to the norm on R(X, Y). Hence (ii) 
implies (ili). 

Suppose that (iii) is satisfied, and 9 is the function defined on 


D, by (2). With ©, | as in (1), for each f in Y* and x in X, 
(3) Op x (A) = (PA) = zA MET x) AED), 
n = 


by (2) and the continuity of the linear functional T - f(T x) on 
P(X, Y). This shows that (i) is satisfied, and that all the functions 


Ọ; , are analytic on D,. Thus 9 is analytic on D_. 


IX 


If ọ has another expansion, > WS of the form (2), then Ọ; | 
has a second expansion > A ECS, x) of the form (3). By the 
E of the esa Taylor expansion, f(T, x) = {(S_x), for 
all fin Y* and x in X. Thus r= S, (n= 0, 1, 2, ...), and the final 


clause of the theorem is proved. 


If © is analytic on an open subset D of the complex plane, and 
takes values in BX, Y), then Ọ is continuous on D when BX, Y) 
has the norm topology. To prove this, we have to show that, if 
Ag € D, then |}(A) - O(A,)|| — 0 as A— àp: Now D contains a 
compact disc Dy with centre at Ag: For each f in Y* and x in X, 
the function Pi x in (1) is analytic on D; its derivative Pe x is 
analytic on D and thus bounded on Do EAE Dy OF x (A) — Pe x (Ay) 


is the integral of 0, > along the line segment from Ag to A; so 
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(Pi, A) = % gl < Mp. IA Aol 


where M, | is an upper bound of IP; „| on Dy. The last inequality 


can be rewritten as 


(ZA POA) | < Me, (A € Dy, AA Ap). 
0 


By the principle of uniform boundedness, there is a constant M such 


that 


1 soy, LPOG Ih sm (A E Dy, A# Xj). 
Thus ||@(A)-9(Ag)|| < M [A-o] — 0, as ÀA — Av. 
Suppose that Ọ is an entire function, with values in R(X, Y). If 
P is a function which is defined and positive for all complex À, we 
write (A) = O(P(A)) if there is a constant M such that 
IPA] < MP(A) for all A. With this convention, Ọ is said to be of 


finite order if there is a positive real number o such that 


(4) (A) = O(exp(|Al")); 


when this is so, the infimum of the set of all such positive o is 
called the order of Ọ. If Ọ has finite order p, then Ọ is said to be of 


minimum type if 
(5) (A) = O(exp(<|A]")) 


for every positive «. Note that, if ọ satisfies one or other of the 
conditions (4), (5); then the same is true of the complex-valued 


functions Pr y (f € Y*, x € X) defined by (1). 


We conclude this section with two results from classical complex 


variabile theory which will be needed in later chapters. 
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Suppose that @ is a (complex-valued) entire function, with 
(0) # 0, which has finite order p. The zeros of 0, counted 
according to their multiplicities, can be arranged as a sequence 


Ajs Àp» Ags -ep with 


[Ad] < LAT « A < --. 


3l 


It can be shown that 
(6) È |A 7 < o 


whenever a > p. If the smallest positive integer a satisfying (6) is 


p+1, then p is called the genus of the zeros of 0, The product 


A A 2 AP 
7 PAs IIl rerni a e 
(7) (a) es (à 7 =) 


(in which the exponential does not appear if p = 0) converges 
uniformly on each bounded subset of the complex field, and is called 
the canonical product formed with the zeros of Ọ (if Ọ has no zeros, 
P(A) = 1). For these facts, and for a proof of the following result, 
we refer to [60: pp. 246-51]. 


THEOREM 1.4.2. (Hadamard’s factorization theorem ). If © 


is an entire function with finite order p, and (0) £ 0, then 
9A) = e PA), 


where Q(A) is a polynomial of degree not greater than p and P is the 


canonical product formed with the zeros of ©. 


THEOREM 1.4.3. Suppose that Ọ is an entire function, 
P(A) £ 0 for all complex A, |9(A)| = 1 when Im A = 0, |9(A)| > 1 when 
Im A> 0, and 0(0)=1. Then there is a positive real number a such 
that O(A) = exp (-1ad). 
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Proof. Since 0(A)/@A) is an entire function, the integral 
Ag@ 
gA) = if dz 
0 (2) 


is the same for all rectifiable arcs from 0 to A, and g is an entire 
function satisfying g(0) = 0, g (A) = 19 (A)/O(A).. The function 
(A) exp (ig(A) ) has zero derivative, and is therefore identically 
equal to 0(0)exp(7g(0)) (= 1). Thus 

PA) = exp(-1g(A)). 


The conditions on |((A)| imply that g(A) is real when A is real, and 
that Im g(A) > 0 when Im A> 0. If we define 


(8) £.(0) = Im gre!) 
for real 6 and positive r, then 
(9) f (0) > 0 (0< @<7, r> 0). 


The Taylor expansion 
(10) EA) = a Àta À t... 


of g is valid for all complex À. Since g(A) is real when A is zeal, it 


follows that a, (=g“) (0)/n!) is real. By (8) and (10), 


2 


f (0) = ar sin 0+asr" sin20+... 


2 


For a fixed positive r, this series converges uniformly for @ in the 


compact interval [0,7], so 


(11) at = 2° £(6) sin n0 dé. 
0 
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It is easily proved, by induction on n, that 
Isin ng <n sin (0<0<m7m, n =1,2,...). 


This, with (9) and (11), gives 


ja | < 2S" £(@ |sin n0] a8 
s m O 
<22 f f (0) sin 0 d0 = na,r. 
7 O 


Thus ja | < na, ri". by letting r — œ, we have a, =0(n > 1). It 
follows from ae that g(A) = a,A, and so ((A) = exp(-1a,A). Since 
|9(A)| > 1 when Im A > 0, we have a, > 0. 


1.5. Elementary spectral theory 
Throughout © 1.5, X is a complex Banach space and I denotes 


the identity operator on X. We write B(X), instead of B(X, X), for 


the Banach space of all bounded linear operators from X into itself. 


If S, T € B(X), then ST € B(X) and ||ST|| < ||S|| ITI]. Thus B(X) 
is an associative linear algebra; furthermore, for a fixed S in 8(X), 
the mappings T — ST and T — TS are bounded linear operators on 
the Banach space (X). 

The spectral radius r(T) of an element T of (X) is defined by 
r(T) = inf eine? n=1, 2,...}. It is clear that 


(1) 0< XT) < JTI; 

we assert also that 

(2) y= Dm ge er 
fi > 00 


To prove (2), choose any positive integer k and let a = rece, 


Each n=1, 2, ... can be expressed as n= g(n)k+r(n), where 
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q(n) > 0 and 0 < n) <k. Then 


Wag T aac 
re ee ee 


IN 


= ak a) |r 
es qh tn) | 7 | [2 


Thus ee S a ENR aen and, since r(n) is bounded, 


lim sup ||T"||“" < @ = A 


It follows that 


lim sup Te < inf {I |r* -k=1,2,...3 


noe 


= (T) < lim inf ||7?||1/", 
n> 
and (2) is proved. 
By Theorem 1.4.1 and equation (2), the series = A"T" converges, 
with respect to the norm on B(x), whenever |A| r(T) < 1; this is so, 


in particular, if |A| ||T|| < 1. Since multiplication by AT is a 


bounded linear operator on R(X), the operator 


satisfies 
ATS 2a AST = D> XPP = s7 


(-AT)S = SU-AT) = 
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Hence [-AT has an inverse in B(X) when |A| (T) <1, and 
(3) (2) ae z APT? (Al KT) <1). 
n = 


It is frequently convenient to make a change of variable, 
A =p", and to consider the existence of (l-T)-+, The resolvent 
set of T is the set p(T) of all complex numbers p for which p/~-T 
has an inverse in R(X); its complement is called the spectrum of T 
and is denoted by o (T). The set o (T), consisting of all complex 
numbers p such that u I-T is not one to one, is a subset of o(T) and 
is called the point spectrum of T; elements of o (T) are called 
eigenvalues of T. If p € a (T), n is a positive integer, and x is a 
non-zero element of X which satisfies (I-T) x = 0, then x is 
called a principal vector of T associated with the eigenvalue u; in 
the case n = 1, x is described as an eigenvector of T. The 


operator-valued function R(u, T), defined on p(T) by 
8 


R(u, T) = (ul-T)}, 


is called the resolvent of T. It follows from (3) that 
O we RT), R T) = Z TIT? (Il > T); 
n= 


and so 
(5) o(T) S fu : |u| < eD}. 


THEOREM 1.5.1. Suppose that X isa complex Banach space 
(not consisting of the zero vector only) and T e B(X). Then o(T) is 


a compact non-empty subset of the complex plane, and 


(6) max {il : u € XT} = r(T). 
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The resolvent set p(T) is open, and R(u, T) is analytic on p(T). 
Proof. If Ho € p(T) and Rio, T) = Ro, then 


pl-T = pol-T+(u-p,)! 
(Hu ol-T) U+(p-Hg Ro). 


lt 


If |u-pol ||Rol| < 1, then I+(u-u)R, has an inverse, È (u,)-p)"Ro; 
thus „I-T has an inverse R(u, T) given by 


Ry, T= = (uo RG  , 
n= 


and so u € p(T). It follows that p(T) contains the disc 
iu: |u-#ol] [Roll < 1}, and that R(u, T) is analytic on this disc. 
Hence p(T) is open, and R(u, T) is analytic on p(T). 

The complement o(T) of p(T) is closed; it is bounded, by (5), 


and is therefore compact. Let r be any real number such that 
(7) r >Q, OT) Sip: |u| < s. 


We shall prove that these conditions imply that r > r(T). IfA isa 
complex number such that 0 < [Aj < r+ then IATH] > rand, by 
(7), ite p(T). Hence I-AT has an inverse, 


CAD a DOA TT sA RA N: 


This shows that (I-AT)~+ exists and is analytic throughout the 
punctured disc {A : 0 < JA] < 17}; by (3), it is also analytic ona 
neighbourhood of 0, and is therefore analytic on the whole of the 
disc {A : JA] < r7!}. For each f in X* and x in X, the 
complex-valued function f xA) = E(-ATY +x) is analytic when 
[Al < r~!. so its Taylor expansion, which by (3) is A AC F(T? x), 


1 


converges when |A| <r. From the equivalence of conditions (i) 
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and (ii) in Theorem 1.4.1, it follows that o(p(T)) = i): uE o(T)}. 
r > lim sup ||T"||}/" = AT). Proof. We begin by remarking that, if S, € R(X) and 
Ea S, S; = S, S, (i, j=1, ..., n), then the product S, S, PPR Sa has an 
This shows that, if r satisfies (7), then r > r(T). inverse if and only if each S; has an inverse; the proof is 
Suppose that o(T) is empty. Given any positive r, (7) is straightforward, and is left to the reader. The result of the theorem 
satisfied and therefore r(T) < r; thus r(T) = 0. However, since is obvious if p is a constant, so we may assume that 


0 éo(T), T has an inverse S in R(X). Since X does not consist of 


plu) = aptaypt... +a,p", 
the zero vector only, 


1 = JAI = SDPN = S° < ISIH _—e 

il? If Ho € o(T), then 
and so ||T"||1/" > [S|] (n = 1, 2, ...). Thus r(T) > ||S||7} > 0, E 
contradicting our previous conclusion that r(T) = 0. This shows p(uo)l-p(T) Pa ; 2 a (uol-T’) 


that o(T) is not empty. By (5), 


n 


= (ul -D4T) 
(8) max flu] : u E€ (TX < r(T). 
for some polynomial q. Since pọl-T has no inverse, neither does 
If strict inequality occurs, we can choose r so that (7) is satisfied p(y -p(T), and therefore p(o) E€ o(p(T)). 


and r < r(T), contradicting our previous conclusion that r > r(T) for Next, suppose that v is a complex number, not of the form p(y) 


each such r. Hence, equality occurs in (8). with u in o(T). Then p(z)-v can be factorized as 


v = a (yy) Ug) «+ (tty 
An element T of B(X) is said to be quasi-nilpotent if r(T) = 0; P(u)-v = a, (p-k) Hey) ars 
by Theorem 1.5.1, this occurs if and only if o(T) consists of the where none of the complex numbers p, .-+, H, 18 in o(T). It follows 
Single point 0. that 


-vI = T-u,1)(T-p,D ... (T-p,_ D; 
If T € B(X), and p(y) = Agta pt... +a pt is a polynomial with p(L)-v a,(T-pyD (T-p,)) Pn 


complex coefficients, then we denote by p(T) the element 


apla, T+... +a T" of BX). 


and since the (commuting) operators on the right-hand side all have 
inverses, so does vl-p(T). Thus, v ¢ o(p(T)). 
THEOREM 1.5.2. (Spectral mapping theorem). If X isa 2 ae ie hue ET 
| that T € H(X). By a closed invariant subspace orf / we 
complex Banach space, T € B(X) and p is a polynomial with complex Suppose tha (X). By 


ici mean a closed subspace M of X with the property that Tx € M 
coefficients, then 
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whenever x € M; we also describe this property by saying that M is 
invariant under T. There are at least two such subspaces, the 
trivial ones {0} and X; any other ones are described as proper 
closed invariant subspaces. If X has finite dimension greater than 
1, the existence of an eigenvector of T ensures that T has a proper 
closed invariant subspace. For infinite-dimensional Banach spaces 
(even Hilbert spaces) it is not known whether or not each T in K(X) 
has a proper closed invariant subspace, although affirmative results 


are known for certain classes of operators. 


If T e R(X), then p(T) is open and therefore each of its 
connected components is open; since p(T) has compact complement, 
exactly one of its connected components, which we denote by 


p,.(T), is unbounded. 


THEOREM 1.5.3. Suppose that X is a complex Banach space, 
T «@B(X) and M is a closed invariant subspace of T. Then M is 


invariant under (ul-T)~* whenever p € p(T), the unbounded 


connected component of p(T). 


Proof. Let M? be the closed subspace of X* defined as in 
1.1(2). Given fin M? and x in M, the function © defined by 


P) = f((W-T)*x) 
is analytic on p(T). Since T”x € M, it follows that (T"x) = 0 
(n= 0,1, 2,...). By (4), p € p (T) and 


O(n) = E wteTx) = 0 
n=0 


whenever |u| > r(T). Since Ọ vanishes on a non-empty open subset 
of the connected set p(T), it follows by analytic continuation that 


Ọ vanishes throughout p (T). 
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We have shown that, if x € M and u € p(T), then 
f( (I-T) tx) = 0 (f eM”). 


Thus (I-T)! x € (M°), = M. 


If T c B(X), M is a closed invariant subspace of T, and ‘ag 
denotes the restriction of T to M, then T” e€ BM) and ||7" || < ITI. 
For x in X, let [x] be the coset x+M (€ X/M). If [x] = [y], then 
x-y € M, Tx-Ty € M, and thus [Tx] = [Tyl]. It follows that the 
mapping Ty: [y] > [Ty] is well-defined; and it is readily verified 
that Ty € B(X/M) and ||Tyll < ITIL. 


1.6. Hilbert spaces 

We assume that the reader is familiar with the elementary theory 
of inner product spaces (see, for example, [26: Chapter 31). 

Let H be a complex inner product space, and denote by <x, y> 


the inner product of vectors x and y in H. The equation 


(1) ixl] = <x, >+ 


defines a norm on H, and the Cauchy-Schwarz inequality asserts 


that 


(2) i<x, y> |< illl œ y E. 


This implies that the inner product is (jointly) continuous in both 
variables, with respect to the norm topology on H. In particular, for 
each fixed y in H, the mapping x > <x, y> is a bounded linear 
functional on K. 

A complex inner product space which is complete with respect 
to the norm in (1) is called a Hilbert space. We emphasize that, in 
this book, the term Hilbert space is used only in connection with 


complex vector Spaces. 


26 COMPACT NON-SELF-ADJOINT OPERATORS 


If H is the Banach Space l» (A) described in $ 1.3, the equation 


<he> = È (ORD (i g €6,(4) 


defines an inner product on H; the sum exists, by Lemma 1.3.2. 
The norm derived from this inner product, as in (1), is the usual one, 
Il Ila on 8,(A). Thus f(A) is a Hilbert space. 

Suppose that H is a Hilbert space, A and B are subsets of H, 
and x, y € H. We say that x is orthogonal to y if <x, y>=0. If 
4 X bY = 0 whenever b € B, we say that x is orthogonal to B; the 
set of all vectors which are orthogonal to B is denoted by B^. If 
a, b> = 0 whenever a € A and b € B, we say that A is orthogonal 
to B. 

Since the mapping x — <x, b> is a bounded linear functional on 
K, for each b, it is easily verified that B™ is a closed subspace of 
K. Let M be the closed subspace generated by B. If x € B, then 
<x, b> = 0 for each b in B, and the conjugate linearity and 
continuity of the mapping c > <x, c> implies that <x, c> = 0 for 
each c in M. Thus, B“ C M~; since the reverse inclusion is 
apparent, B™ = M™. Note also that B N B~ = {0} since, if 
x € B N BT, we have 0 = <x, xX = ||x||?. 

Suppose that H is a Hilbert space and ACH. We say that A is 
an orthogonal set if <a, b> =0 whenever a, b € A and a £ b; if, in 
addition, ||a|| = 1 for each ain A, we say that A is an orthonormal 
set. An orthonormal set A is linearly independent (by which, we 
mean that every finite subset is linearly independent); for if 
Ap wang EAs Digi, a, are complex numbers and È aja, = 0, 
then 


0 = [F ai a, > = a, 


foreach k = 1, ..., n. 
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In the following lemmas, we make use of the concept of unordered 


summation, which was described in $1.2. 


LEMMA 1.6.1. Suppose that A is an orthogonal set in a Hilbert 
space K. 


(i) È a exists if an only if llall? <œ. When this is so, 


> 
acA aCA 


| 2 alt= = flall’. 
EA a€A 


a 


(ii) If A is orthonormal and f is a complex-valued function on A, 


then © f(a)a exists if and only if È KOK <œ. When this is 
a EA acA 


so, || È f(a)a\|? = > KOK 
a€A acA 


Proof. It suffices to prove (i), since (ii) then follows at once. 


For any finite subset F of A, 


ae = b 
® WE al =< 3,6 3.8 


2 
= 3 <a b>= Z lla. 


a,b EF 


It now follows, from the Cauchy criterion (Lemma 1.2.2) that a 
a 
exists if and only if È Ilall? <œ. When this is so, the infinite 
acA 


sums in (i) are limits of the appropriate nets of finite subsums; 
since the norm is a continuous function on K, it follows from (3) 


that 
2 
| È alt= = fall’. 
atA atA 
LEMMA 1.6.2. IfA is an orthonormal set in a Hilbert space K, 


and x € X, then 
(i) LD (Cx, a> |? < |\x||? (Bessel’s inequality); 
a cA 
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(ii) the sum x, = a <x, aya exists, and x-x, € A”; 
a 


(ii) ||x-x\|? = |Ixl|?- 2 <x a> 


Proof. If F is a finite subset of A, and g(a) = <x, a> for each 


ain A, then 
= 2 <x, a> al|* = <a g(a)a, cae 6(b)b > 
= 2 a 
[|x|] 22. g(a) <a, x> 
x4 +> Wa 
wee &(b) <x, b> 


i oe 8(a)g(5) <a, b> 


tl 


2 2 
xX Ea > > e 
Ixl? -= E 1< a>] 
Thus, for each finite subset F of A, 
(4) E KDl 
l a cF |x a 
2 2 2 
= |X Siig o> 3 [ ‘ 
Ixll?-llx- 2 <% adall? < list 


This proves (i), and the existence of the sum x, in (ii) now follows 
from Lemma 1.6.1. Since the mapping y —> <y, b> is a bounded 


linear functional on H, for each b in A, we have 

CX b> = a <x, a>Ca b> = <x, by (be A), 
So X-X, € A™. This proves (ii); and (iii) follows from (4) since the 
norm is a continuous function on H. 


THEOREM 1.6.3. IfA is an orthonormal set in a Hilbert space 


H, the following five conditions are equivalent. 
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(i) AW = 10}. 
(ii) For each x in H,x= 2 <x, a> a. 
aCA 


(iii) For each x and y in K, Cx, y> = 2 <x, a> <a, y>- 
a 
(iv) For each x in H, lx]? = 2 I <x, a>|?*. 
a 


(v) The closed subspace generated by A is the whole of H. 


Proof. Suppose that (i) is satisfied. If x € H then, by Lemma 
| 6.2(ii), the sum È <x, aa exists, and 
a€A 


a x, ayacA = 10}. 
a EA S > 
Hence, (i) implies (ii). Since the mapping z > <z, y> is a bounded 
linear functional on K, it is apparent that (ii) implies (iii). By taking 
y x in (iii), it follows that (iii) implies (iv). 

Suppose that (iv) is satisfied. Given x in HK, Lemma 1.6.2(iii) 


unplies that x= È x, a>a, so x lies in the closed subspace M 
acA 


venerated by A. Thus M = H; so (iv) implies (v). Finally, if (v) is 
nutisfied, then A~ = K> = {0}; so (v) implies (i). 


Let H be a Hilbert space and suppose (to avoid trivialities) that 
(( does not consist of the zero vector only. Then there exist 
non-empty orthonormal sets Ap in H. A simple argument based on 
Zorn’s Lemma shows that each such A, is contained in an 
orthonormal set A which is maximal, in the sense that A is nota 
proper subset of any orthonormal set B. It is easily verified that an 
orthonormal set A is maximal if and only if A” = {0}. By Theorem 
1.6.3, A is maximal if and only if it verifies any one (and hence all) 
of the conditions (i), ..., (v) listed in that theorem. In view of (ii), 
a maximal orthonormal set in H is called an orthonormal basis of H; 


ihe equality in (iii) is known as Parseval’s equation. 
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In the preceding paragraph we have proved the following result. 


THEOREM 1.6.4. If H is a Hilbert space which does not 
consist of the zero vector only, then H has an orthonormal basis. 
Furthermore, every orthonormal set in ¥( is contained in an 


orthonormal basis. 


The dimension of a Hilbert space H, denoted by dim(H), is 
defined to be the cardinal number of an orthonormal basis A of H; it 
can be shown [59: p. 117] that this does not depend on the choice 
of A. Hilbert spaces with the same dimension are isometrically 
isomorphic; separable infinite-dimensional Hilbert spaces have 
countable orthonormal bases and are isomorphic to CN), where N 
is the set of all positive integers [59: pp. 120, 121]. 

If H is a Hilbert space and J is any set with cardinal number 
equal to dim(H), then every orthonormal basis of H can be indexed 
by J, and so expressed in the form t9, jeJ). If Ais an 
orthonormal set in H then, since A is contained in an orthonormal 
basis, it can be expressed as ly, : k€ K}, where K is a subset of 
J. Of course, this can be done in many ways: when an orthonormal 
set is given in this form, we shall use the term ‘orthonormal system’ 
to indicate that a particular indexing is specified. We shall reserve 
the phrase ‘orthonormal sequence’ for the case in which K is 


f1, 2, 3, ...}, or tl, 2, ..., pl for some p. 


Our next theorem concerns the Schmidt orthogonalisation process 
for constructing orthonormal sequences. If X is a normed space and 
Xj eee X, E X, we denote by Ag Cae ae x) the subspace 


(necessarily closed) generated by yp eee Xy 
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THEOREM 1.6.5. Suppose that (x,, Xy X3 -> .) is a linearly 


independent sequence of vectors in a Hilbert space K. Then there 


t: an orthonormal sequence Qi Yor Yzy- .) in H such that 


S fresa Yp) = O nay x) for each n. 


Proof. We construct ys Yo ++- inductively, and begin by 
delining y} = ie Ale x,. Now suppose that we have chosen an 


arthonormal system (y,, ..-, y,_,) such that 


(5) LG oI) = Lors R as), 
SINCE X, g Ss oe x4) = £Q, arr Ys the vector 
r-1 
(0) aaa aa i Lz» iy; 
J] = 
I non-zero, and is clearly orthogonal to y,, -o ¥,_4- It follows 


thal, if y, = alge then Q,» -. 
Ry (5), each of Yy 


o y,) is an orthonormal system. 
T, is a linear combination of Xp eo Xs 


und (6) implies that the same is true of z, hence also of y,. Thus 
Ly ee Vi) L(x, cea Hj 


ince both these subspaces are r-dimensional, they are equal. 

In this way, we construct inductively an orthonormal sequence 
(Vis Yoo +s .) such that (5) is satisfied for all n. If there are just m 
veclors Xj; +++, X, in the given linearly independent sequence, the 
process terminates with the construction of y_ ; if the sequence (x, ) 


1 infinite, then so is (y,): 


THEOREM 1.6.6. IfM is a closed subspace of a Hilbert space 
I(, then M N M~ = {0}, M+M™ =H and (M Y° = M. 


Proof. We have already noted that BN B` = {0} for any 
non-empty subset B of H. Let A be an orthonormal basis of the 
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Hilbert space M (hence an orthonormal set in H). Since A generate: 
the closed subspace M, we have A™ = M™. Given x in H, the sum 
x, = È <x, aJa exists and x-x, € A = M"^, by Lemma 1.6.2. 

1 aCA : 


Clearly, x, € M, so 
x = X,;H(x-x,) € M+M”. 


This shows that M+M™ = K. 

It is apparent that MC (M”)~, so x, € (M~)~. If, also, 
x e (M`), then Kak 6 (M~) N M~ = {0}; and thus x = xy eM. It 
follows that M = (M7). 


COROLLARY 1.6.7. If B is a subset of a Hilbert space M, 
then (B’)~ is the closed subspace M generated by B. 


Proof. B™ = M™, so (B’)~ =(M~)~ =M. 


THEOREM 1.6.8. Suppose that H is a Hilbert space and 
y €H. Then the equation f Ax) = <x, y> (x € Ñ) defines a bounded 
linear functional f, on K, and KAI =||y||. If f is a bounded linear 


functional on H, then there is a unique element z of H such that 
fei 


os 

Proof. Since Eœ] =|<x, y>| < ||x|| Iiy||, with equality 
when x = y, it follows that f, is a bounded linear functional on K, 
and IAI = ||y||. To prove that each bounded linear functional f on 
H has the form fo. for some z in K, it suffices to consider the case 
in which f #0. The closed subspace M = FiO) is not the whole of 
H, so M~ £ {0} by Theorem 1.6.6. Let y be a unit vector in M”. 


For each x in K, the vector {(x)y-f(y)x is in M, hence is orthogonal 


to y. Thus 
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0 = Cf(x)y-f(y)x, yD = ff) G yD = a- & 2), 


where z= fy. Hence f= f,. 


In later chapters we shall need the idea of the direct sum of a 
lumily of Hilbert spaces. Suppose that A is a set and, for each ain 
A, a Hilbert space He is specified. We denote by È © iH. a EA} 


the set of all functions f on A such that 


f(a) eH, (acA) E JEDI]? < o. 
acA 
It is not difficult to verify that È © fH : a € A} becomes a Hilbert 


“pace when the algebraic operations and inner product are defined 


by 


(af+Bé (a) = af(a)+Pé(a), 
<f, 8> = x <f(a), g(a)>. 


We call this space the Hilbert direct sum of the family [H, : a € A} 
(when each s is the complex field, it coincides with l, (A) ). The 
Hilbert direct sum of a finite family iH -j=l1,..., n} is sometimes 


denoted by 


He OH, o... oK. 


n 


Given b in A and x in a let U,x denote the element f of 
+ @ iH : a € A} defined by f(b) = x and f(a) = 0 when a€ A, afb. 
It is clear that U, is an isometric isomorphism from K, onto a 


closed subspace of È @ {H_: ac A}. We frequently identify Hs 


a 
with its image UH.) under this canonical mapping U, 
If tH : a € Å} is a family of pairwise orthogonal closed 


subspaces of a Hilbert space H, the sum 
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Uf= È fa 
acA 


exists, whenever f€ È @ iH. : a € A}, by Lemma 1.6.4, and 


ee =E E EotT = ILA. 
a cA 
It follows that U is an isometric isomorphism from È @ {H, : a € A} 
onto a closed subspace M of H, and it is easily verified that M is 
the smallest closed subspace which contains each K. We 
frequently identify È @ us : a € A} with M by means of the 


canonical mapping U. 


We conclude this section with another example of a Hilbert 
space. Suppose that (E, &, u) is a measure space, and that H is th 
vector space L,(E, p) of all (equivalence classes of) complex- 


valued -measurable functions f on E for which 


T |t|? du < œ, 
E 


The equation 
CE e> = Je dy 


defines an inner product on H, and H is complete with respect to 


the associated norm, 
1 
fll = <6 E? = ES |e? dud 


Thus, L(E, u) is a Hilbert space. For the proofs of these 
statements, we refer to [25: p. 174-77]. 
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| 7. Bounded linear operators on Hilbert space 
Suppose that } is a Hilbert space and p is a complex-valued 
function on the Cartesian product HxH. We say that pis a 


nesqui-linear form on H if 


p(ax+By, z) = ap(x, z} pQ, 2), 
p(x, ay+Bz) = ap(x, y)+Bp(x, 2), 


whenever x, y, Z € H and a, B are complex numbers. A sesqui-linear 


torm p on H is bounded if there is a real number M such that 
Lox, y) | < Mli Iyl œ y € 40; 
when this is so, the least such M is denoted by ||p||. 


THEOREM 1.7.1. If T ¢ BGO and 


(1) Pr y) = <Tx, y> (x, y e K), 


then py is a bounded sesqui-linear form on H, and ||p.-|| = ITI Z£ 
p is any bounded sesqui-linear form on K, there is a unique element 


S of BQ) such that p = Po 


Proof. Clearly p, is a bounded sesqui-linear form, and 
pati < |IT||, for each T in RGO. Furthermore \Tx||7 = 
Tx, Tx> = pp(% Tx) < |lerli ili ||Tx||, and thus 
xl] < [lepll Ilzi], for each x in X; so ||ppl| = IITIL. 

If p is a bounded sesqui-linear form on HK, and x € K, the 
mapping y — p(x, y) is a bounded linear functional on H, with norm 
not exceeding ||p|| ||x||. By Theorem 1.6.8 there is a unique 
element Tx of H such that |{7x|| < ||p|| {|x| and pt y) = <y, Tx>, 
whence p(x, y) = <Tx, y>. Since p is linear in the first variable, 
the mapping x — Tx is linear. Thus T € BCH) and Pp =P. 
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If Te BH and uvek, a straightforward calculation shows 
that 
(2) 4<Tu, v> = < T(utv), utv >—<T(u-v), u-v ò 
+i ÇT(u+iv), utiv>-i <T(u-iv), u-iv>. 
LEMMA 1.7.2. If T €B) and <Tx, xX = 0 for each x in M, 
then T = 0. 


Proof. By (2), Pru v) = €Tu, vò = 0 for each u and v in K. 
It follows from Theorem 1.7.1 that ||T]| = [|p..|| = 0. 


If Te BK), the equation p(x, y) = <x, Ty> defines a bounded 
sesqui-linear form p on K. By Theorem 1.7.1, there is a unique 


element T* of BH) such that p = Py»; that is, 


(3) <x, Ty> = <T*x, y> (x, y EH). 
The operator T* is called the adjoint of T. From (3) and Theorem 
1.7.1, it is easily verified that ||T|| = ||7*||, and that 
(A+B)* = A*+B*, (aA)* = aA*, 
(AB)* = B¥A*, (A*)* = A. 
It follows that A* has an inverse if and only if A has an inverse, 


and then (A*)7} = (Aah)x, By applying this with AJ-T in place of 


A, we conclude that 


(4) o(T*) = {Xk : AX €a(T)}. 

Since ||Tx||* = <Tx, Tx> = <T*Tx, x> < ||T*T|| ||x||?, we have 
ITI? < [ITTI] < ZITI = ITI, so 

(5) |7*7]| = IIT]? 
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With cach T in B(H) we associate two closed subspaces of K, 
the null space I = fx eH: Tx = 0} of T, and the closed range 
apavi R of T, which is the closure of the set {Tx : x € H}. From 


(A) x Re if and only if T*x = 0, so 


tb) Rx =N RR ote, 


T T*?’ 


Clearly Hie C Mara and the reverse inclusion holds since 


[| Px||* = <T*Tx,x >. Thus 
(/) Nrsr = Tp Rrsr = Nrer = Up = Rre 


An element A of B(H) is said to be self-adjoint if A = A*. Each 
T in BOQ can be expressed (uniquely) as A+iB, with A and B 
nelf-adjoint. The operator A (= 4% (T+T*)) is called the self-adjoint 
pert (or real part) of T, and is denoted by Re T; while 
h( Y%i(T*-T)) is called the skew-adjoint part (or imaginary part), 
Im T. We say that T is normal if T*T = TT*; this occurs if and 
only if AB = BA. 


LEMMA 1.7.3. Suppose that T € BH. 
(i) T is self-adjoint if and only if (Tx, x> is real for each x 
in J. 


(ii) T is normal if and only if ||Tx!| = ||T*x|| for each x in K. 
Proof. Since 
<(T-T*)x, xX = <Tx, x>-Cx, Tx> = 2i Im CTx, x 
C(T*T-TT*)x, x> = ||Tx||? — |\7*x||? 
for each x in H, the stated results follow at once from Lemma 1.7.2. 


THEOREM 1.7.4. If T is anormal element of BH), then 
r(T) = ||T\I. 
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Proof. By Lemma 1.7.3, IIT 7x| | = ||T*Tx|| for each x in K. 
This, together with (5), implies that 


|e er eel 
Since T” is normal, ||T7"|| = ||7"||? (9 =1, 2, ...). Hence 
ee eh, e eee 


when k has the form 2” (n=1, 2, ...). It follows that 
(T) = lim |T% = |ITII, 


COROLLARY 1.7.5. If T € ®(H) and T is both normal and 


quasi-ni[potent, then T = 0. 


THEOREM 1.7.6. If T is a self-adjoint element of B(H), then 


‘o(T) is a subset of the real line. 


Proof. We have to show that T — Al has a bounded inverse if 
A=ptivandy 40. If x €X, then (Tx, x> is real, so 
|| Px-(utiv)x|| |x|] > | <Tx-(utiv)x, x>| 


= | LTx, x>—(utiv)||xl|*| > v] Tall? 
(8) ||Tx—(u tiv)x|| > |v} Illl @ Fb. 


In particular, both T~AT and its adjoint T-AT have null space {0}. 
By (6), the closed range space of T-AI is the whole of H. Thus 
each y in H is the limit of a sequence ((T-ADx,), with x,, x 
in K. By (8), 


n+ 


[Zn n] 


< lelt ||CT-AD (x -xl — 0 
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ns min(m, n) %2. Thus (x,) converges to some x in H, and 
to limit ADX = (T-A Dx. This shows that T-A I maps K onto the 
whole of Il; by (8), it is one to one, and its inverse R is bounded, 
with IIRI © lett. 

ur T* ¢ BOO, the spectrum o(T) is a compact subset of the 
wal line. The set C(o(T)) of all complex-valued continuous 
hin {tans on o(T) is a Banach space with respect to the uniform 


nom, 
\|f|| = sup t|fQ)| : A € oT). 


When £ g € C(o(T)), the product fg and the complex conjugate 
hunetion f (both defined pointwise) are also in C(o(T)). By the 
tfunsical Weierstrass approximation theorem theorem, the set P of 
ull polynomials (considered as functions on o(T)) is an everywhere 


dense subspace of C(o(T)). 


THEOREM 1.7.7. If T= T* € BOO, there is a unique linear 
mapping £— f(T) from C(o(T)) into ®(H) such that 
(i) £(T) has its elementary meaning when f 1s a polynomial; 
(ii) HEI) = (lA) Œ e C@WM))). 
Kurthermore, for each f and g in C(o(T)), 
(ii) CAT) = D8; 
(iv) £(T) = £(T)*; 
(v) f(T) is normal; 
(vi) £(TD)S = S£(T) whenever S € BH) and TS = ST; 
(vii) if à €o (T), x € H and Tx = àx, then f(T)x = f(A)x. 
Proof. If p(A) = agtayAt .-. +a A”, then p(A) = 
Wy l@yAt -> +a A” when A € o(T). Thus 
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p(T)* = (agta, T+... tale 


li 


alta, T+ ..: +a T” = p(T). 


It follows that p(T) is normal. By Theorems 1.5.1, 1.5.2 and 
1.7.4, 


leI] = rT) 


sup {la| : p € o(p(T))} 
sup pA) : À €o(T)} = llel 


I} 


where ||p|| denotes the norm of p as an element of C(o(T)). This 
shows that the linear mapping p — p(T) from P into RH) is 
isometric. Since P is dense in C(o(T)) and &(K) is complete, this 
mapping extends uniquely, by continuity, to an isometric linear 
mapping f — f(T) from C(o(T)) into BH). 

i We have now proved the existence and uniqueness of a linear 
mapping f — f(T) satisfying parts (i) and (ii) of the theorem. It is 
evident that parts (iii), ..., (vii) are satisfied when f and g are 
polynomials; by continuity, they remain valid whenever f, 

§ € C(o(T)). This completes the proof of the theorem. 
The mapping f — f(T) described in Theorem 1.7.7 is called the 
functional calculus for the self-adjoint operator T. A further 


property of this mapping is proved in Theorem 1.7.9. 


Suppose that S and T are self-adjoint elements of B(H). We 
write S< T if (Sx, x> < CTx, x> for each x in K. If S< T, and 
S Æ T, we write S < T. It follows from Lemma 1.7.2 that < is a 
partial order relation. It is easily verified that, if S< T, then 
A*SA < A*TA for each A in BH). Furthermore, $< ||S$| |I for every 
self-adjoint S in BH). 

lf Te BH) and «Tx, x» is a non-negative real number 


whenever x € H, it follows from Lemma 1.7.3 that T is self-adjoint, 
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ant no T : 0; in these circumstances, we call T a positive operator. 
W Sund T are positive operators and a is a positive real number, 


then Si 7.0, aS > 0. 


NyeoREM 1.7.8. If T is a self-adjoint element of B(H), then 


the: fallowing conditions are equivalent. 


(i) o(T) consists of non-negative real numbers. 
(i) T= HŽ for some positive operator H. 
(iii) T = A*A for some A in BU). 
(nv) T20. 
When these conditions are satisfied, the operator H occuring in (11) 


(^ Unique. 


Proof. Itis clear that (ii) implies (iii). If T = A*A, then 
TSi <Ax, Ax > > 0, for each x in H; so (iii) implies (iv). 

If (i) is satisfied, the real-valued function g(A) = AMS is 
rontinuous on o(T), and (s(a)|4 ~). Hence the operator S = g(T) is 
nell adjoint, and S4 = T. Since (iii) implies (iv), H = S? is a 
ponilive operator and H? = T. Thus (i) implies (ii). 

Suppose T > 0, and define h in C(o(T)) by 


0 when A > 0, 


a a GAs when A < 0. 


“ince A is real-valued and A(A)AA(A) = [RAN] when A € a(T), the 
operator S = ACT) is self-adjoint and STS = —S*. Since (iii) implies 


(v) we have Sf > 0, and so 
0 > ~S* = STS = STS* > 0. 


Thus 0 = S4 = (A(T)\* and, since the mapping f — f(T) is one to one, 
i follows that h* is the zero element of C(o(T)). This, together 
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with (9), shows that o(7) consists of non-negative real numbers. 
Thus (iv) implies (i), and the equivalence of the four conditions in 
the theorem is proved. 


It remains to show that, if T>0, the operator H in (iii) is 


unique. Now H = f(T), where f is the element of C(o(T)) defined by 


POO =e /2 Tf (Pp) is a sequence of polynomials converging 
uniformly to f on o(T), then 


(10) ||H-p (D| -0 asno. 


Suppose that K is any positive operator such that K? = T. Since 


o(K) consists of non-negative real numbers, and o(T) = Ta : pe€o(K)}, 


the polynomial q (4) a p p’) converges, uniformly on o(K), to the 
function fu”) = u = ulu), where u is the identity mapping on o(K). 
Thus 


(11) [IK-p,(D|| = ||K-¢,(K)|| = |lu-g, || > 0 as n — o. 
By (10) and (11), H= K. This shows that H is unique, and so 


completes the proof of the theorem. 


When T > 0, the operator H in Theorem 1.7.8(ii) is called the 
positive square root of T, and is denoted by Tl’? Note that, if 
T>0, x eH and CTx, x> = 0, then Tx = 0; for 

re xl eC Ta 2 [Tx x> = 0, 


and so Tx = T'/?T!/*x = 0. 
THEOREM 1.7.9. Suppose that T is a self-adjoint element of 


B(H) and f € C(o(T)). Then f(T) > 0 if and only if f takes non- 


negative real values throughout o(T). 
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Proof. If f(A) > 0 (À € o(T)), then there is a real-valued 
lunetion g in C(o(T)) satisfying g = f. The operator A = g(T) is 
nelf-adjoint and f(T) = A2; so f(T) > 0. 

Conversely, suppose that f(T) > 0. Since f(T) is self-adjoint, 


lis real-valued on o(T); and so, f = u-v, where 


u(\) = max {0, f(A}, v(A) = max 10, -F (AJ. 


Since y? is a non-negative function, in C(o(T)), it follows from 
(he first part of this proof that [wn] > 0. However, 


vive - viv = =y?, SO 
0 < WT)*E(DVAT) = LWT) < 0. 


Thus [Wr = 0; it follows that v3, and hence also v, is the zero 


element of C(o(T)). Thus f(A) = uQ) > 0 for each A in o(T). 


We now describe the simplest class of self-adjoint operators. If 
M is a closed subspace of a Hilbert space K, then 
M N M“ = {0}, M+M™ = H. By elementary linear algebra (see, for 
example [26: pp. 73, 74]) there is a unique linear operator E on 
such that Ex € M and x-Ex € M~ for each x in HK. Furthermore, 
k’ = E and 


(12) M=ixeH : Ex=x} = {Ex : xe Kl, 

(13) M“ ={xeH : Ex = 0}. 

Since Ex and x-Ex are orthogonal, with sum x, we have 
Hal? = [JExl|? + [lx-Exl|” > |Ex||? (x €J). 


‘hus E is a bounded linear operator, with ||E|| <1. Furthermore, 


< Ex, x-Ex) = 0, and therefore 
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2 ; : ; ; 
Ex, x> k (Ex, Ex)» = ||Ex|| >0 (xe KH), When these conditions are satisfied, F-E is the projection from K 
It follows that E is a positive self-adjoint operator. We call E the aN AENM a: da paG a SESE projection trom K onto M. 


TRN í , is i f j t e K, 
projection from H onto M ( it is sometimes described as an OM is a family of POR A sul speces O E Hie anes 


a ; s j tai ch 
orthogonal projection, since its range space M is orthogonal to its aiei asma lest closed subspace V Ma Watch Conta as paci 


E M™) anil u largest closed subspace A MG N M) contained in each M. 
Conversely, if E € BH) and E* = E = E*, then (12) defines a 

closed subspace M of H. Since M = Nas it follows from (6) that 

My = g = Me so (13) is satisfied. If x € H then clearly Ex € M; 


moreover, x-Ex € M~ by (13), since E(x-Ex) = Ex-E*x = 0. Hence 


Iemma 1.7.10 shows that the set of closed subspaces of K 

(partially ordered by C) is order isomorphic to the set P of projections 
(partiully ordered by <). Thus each family (E_) of projections has a 
lausi upper bound VE. and a greatest lower bound AE, in P. We 

uno Ihe notations My N, MAN, EvF, EaF when only two subspaces 
-a € A} of 
pmojeclions is said to be directed upward if, given any f and y in A, 


there is a Ò in A such that Ey, > Eg, Es > E The definition of 


E is the projection from H onto M. It follows that projections are 


characterized by the conditions E2 = E = E*. m projections are under consideration. A family {E 


a 


Suppoge that E and F are the projections from H onto closed 
subspaces M and N, respectively. It is readily verified that M is 
Sitlinebaal Carat and Gnlyar EF Od equivalently, Since directed downward is similar, but > is replaced by <. 
FE = (EF)*, FE = 0). If FE = EF, then each of the operators 


EF, E+F-EF satisfies the equations P? = P* = P, and is therefore 


The strong topology on B(K) is the coarsest topology such that, 
fit euch x in K, the mapping T — Tx from B(H) into H is continuous. 
Woh, RH), then sets of the form 


a projection; a straightforward argument shows that the corresponding 


closed subspaces of H are M N N and M+N, respectively. (14) (E e BH) : |[Ex,-E 4x, || <e (Ge=l,...,n)} 
Elementary linear algebra (see, for example, [26: p. 148]) 
suffices to prove the following result. (where Xp seer XE H and «> 0) form a base of neighbourhoods of 
I, iu the strong topology. In the following theorem, ‘strong 
LEMMA 1.7.10. Jif E and F are the projections from a Hilbert i lonure’ is used as an abbreviation for ‘closure in the strong 
space H onto closed subspaces M and N, respectively, then the topology’. We shall adopt similar conventions when using other 


following conditions are equivalent. topological terms in relation to the strong topology. 


(1) MCN. M 
DFES [IEOREM 1.7.11. Suppose that {E _ : a € A} is a family of 
I = E. 
1) projections acting on a Hilbert space H, and is directed upward 
ui) EF =E. 


(respectively, downward). Then VE, (respectively, ^ E,) lies in 


(iv) \|Ex|| < ||Fx|| (x ek. a € Ah. 


(v) (he strong closure of [E 
v E<F. 


a 
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Proof. We consider first the case in which iE. :a € A}is 
directed upward. Suppose that E_ is the projection from H onto the 
closed subspace M,, so that E, = V E, is the projection onto the 
closed subspace M) generated by UM,. In fact, UM, is already a 
subspace, and so Mọ is its closure: for, if x, y € U M,, we may 
choose # and y, and then 6, such that x € Mg, y EM., 

M g CMs, M, CMs; it follows that x, y € Ms, and so z EeM,CUM,, 
whenever z is a linear combination of x and y. 

Fvery strong neighbourhood of E contains a set of the form 
(14), so it suffices to show that this set contains an E: Since 
E 4X; lies in the closure M, of U M,, there exist aQ) in A and y; in 
M aç; Such that ||Egx,-y,|| < e. There is an element B of A such 
that May SM g G=l1,...,n), and so Yy eeo Yp E M; - Thus 


| E9x;—-Ey x, || = | (Eq-Eg ) (E9x,;-¥ 1 
< [Eggy] <e Gad. n) 


Hence Eg lies in the set (14). This proves the lemma in the case 


where {E_ : a € A} is directed upward. 


a 
Now suppose that {E, : a € A} is directed downward, and let 
E,=AE,. Since two projections P, Q satisfy P < Q if and only if 

I-P > I-Q, it follows that U-E, : a € A} is directed upward and 
VU-E ,) =I-E,. It follows from the first part of the proof that 
I-E, lies in the strong closure of H-E, -aE A}. If Xj ree) XE H 


and e > 0, there exists 6 in A such that 
| |U-E,)x, UAE 9)x: || GG (j = 1, EET n). 


Thus E; lies in the basic neighbourhood (14) of Ep; and so E, is in 


the strong closure of iE. >a € A}. 
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Suppose that H is a Hilbert space and V € RGO. We say that V 


in a partial isometry if there is a closed subspace M of H such that 
(15) I|¥x|| = |ix|| (x €M), Vx=0 (x €M”). 


When this condition is satisfied, the restriction of V to M is an 
inometric isomorphism from M onto the subspace N = Vx: xeM} of 
I. Since M is a closed subspace of H, it is complete; thus N is 
‘omplete, and is therefore closed in H. Since M+M~ = H, it follows 


fiom (15) that 
(16) N = {Vx : xé€M} = Vx: xe Hh 


We call M the initial space and N the final space of V; the 
corresponding projections E and F are called the initial and final 


projections of V. Clearly 
(17) V = FVE. 
it x c K, then Ex € M and, by (15), 


<V*Vx, x> = (Vx, Vx> = <VEx, VEx> 
= Ex, Ex» = < Ex, x». 
Ly Lemma 1.7.2, 
(18) V¥V = E. 


Given x in H, it follows from (16) that Fx = Vy for some y in K. By 
(17), V*= V*F and V = VE, so 


VV*x = VV*Fx = VV*Vy = VEy = Vy = Fx. 
Thus 


(19) VV* = F. 
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If x € N, then 
I|V*x||? = CVV*x, x^ = (Fx, x> = ||xl|?; 


if x € N“, then Fx = 0 and, by (17), V*x = V*Fx=0. Thus V* is a 
partial isometry, with initial projection F; by (18) and (19), its final 
projection is E. 

By a unitary operator on a Hilbert space H we mean an invertible 
element U of BH) satisfying Ux] | = ||x|| (x € H). Since U is a 
partial isometry with initial projection /, it follows that U*U = I. 


Hence the inverse U~! is U*, and 


(20) < Ux, Uy> = <U*Ux, y> = Cx, y> (x, y e K), 


Conversely, if U is an invertible element of BK) and UTI = U*, 
then (20) is satisfied; in particular, ||Ux|| = ||x|| (x € }), so U is 


unitary. 


THEOREM 1.7.12. Suppose that T € (H), and let M and N be 
the closed range spaces of T* and T, respectively. Then there 1s a 
partial isometry V with initial space M and final space N, anda 
positive operator H, such that T= VA, H=V*T. These conditions 


determine V and H uniquely. 


Proof. Let H be the positive Square root of T*T. By two 
applications of (7), the operators H, H? = T*T and T* all have the 


same closed range space M. For each x in H, 
\|Tx||? = <T*Tx, xX = CH?x, x> = ||Hx||?. 


It follows that the mapping Hx — Tx is well-defined, and is an 
isometric isomorphism from a dense subspace of M onto a dense 
subspace of N. This mapping extends by continuity to an isometric 


isomorphism W from M onto N, such that WHx = Tx (x € Hy, 
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The linear operator V defined by 


Vy = Wy (yeé™M), Vz=0 (2€M~) 


jm a quirttal isometry with initial space M and final space N; and 
Fs Vux(x €X). Since V*V is the projection onto M, the closed 


tange space of H, we have 
Hx = V*VHx = V*Tx (x eH). 


Thin proves the existence of operators V and H with the stated 
jWaperlics, 
suppose that V, and H, also have these properties. Then V, Ve 


In the projection onto the closed range space N of T, and so 
De ts ea a 
Hi = HTH, = T*V VET = T*T. 


‘ince the positive square root of T*T is unique, H, =H. For each 


a in I(, 


VHx = Tx = V.H 


Hz = V Ax. 


“toce V and V, are continuous, it follows that Vy = V y for each y 
in the closed range space M of H. Furthermore, Vz = Vz = 0 
whenever z € M`. Thus V = V,. This shows that the operators V 


und H are uniquely determined. 


When operators T, V and H are related as in Theorem 1.7.12, we 


describe VH as the polar decomposition of T. 


1.8. Compact linear operators 
Suppose that X is a complex Banach space and T € B(X). We say 
that T is a compact linear operator on X if, given any sequence (x,) 


in X such that sup i||x ||: n =1, 2,...}< 0, there is a subsequence 
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(gy such that (TX, (gy) converges to an element of X. It is readily 
verified that the set Č of all compact linear operators on X is an 
ideal in B(X); that is RS, SR, aS+BT € C whenever a, B are complex 
numbers, R € R(X) and S, T € C. Furthermore, C is closed with 
respect to the norm topology of ®(X) [59: p. 285, exercise 3]. 

An element T of B(X) is said to have finite rank if the subspace 
M ={Tx:x € X} of Xsis finite-dimensional; the dimension n of M is 
then called the rank of T. Such an operator T is compact, since 
every bounded sequence in the finite dimensional normed space M 
has a convergent subsequence [59: Theorem 3.12-D]. It is readily 
verified that the operators of finite rank form an ideal in BCX). 

We now turn to the spectral theory of compact linear operators. 
A full account of this subject is given in [59: pp. 268-85]; 
accordingly, a number of the results that follow are stated without 
proof. In the next theorem, T is a compact linear operator acting 


on a complex Banach space X, A is a non-zero complex number, and 


the subspaces I and Ki of X are defined, fork = 0, 1, 2, ..., by 
(1) Ny = {xe X : AILT)" = 0}, 
(2) R, = WAl-T)x : x € X}. 

THEOREM 1.8.1. Fork=0, 1,2, ..., the subspaces Ky. and 


I, are invariant under T; Mu, 1s finite-dimensional and te 1s closed, 


There is an integer v(> 0) such that 


AT Gg OK Re NaN ak ow, 
lii) Rp ah ae HOR SY. Rye al ew, 
Furthermore, 


N, NAR =103 +R =X. 
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Fhe teathction of AI-T to RK, has an inverse in BR), and the 
pealii tion to J, of (AI-T)” is 0. 


The integer v occurring in Theorem 1.8.1 is called the index of À 
wilh tempect to T. If Ais not an eigenvalue of T, then the operator 
Ad Llin one to one, and so Jt = {0} = 1 Thus v = 0, R = Ro = X, 
awl A? T has an inverse in R(X). Hence A € p(T). If, on the other 
hand, A is an eigenvalue of T, then NA {0} = see and sov>Q. In 
(lita cuse, the dimensions of the subspaces It and N, are called 
the goormetric multiplicity and algebraic multiplicity (respectively) 
uf Aus an eigenvalue of T. Note that I consists of all 


wiyenvectors of T associated with the eigenvalue À, while N J 


tonnists of all principal vectors of T associated with À. 


THEOREM 1.8.2. Let T be a compact linear operator acting 
wna Banach space X. If A € o(T) andà #0, then A is an eigenvalue 
wf T. Furthermore, o(T) is either finite or countably infinite; in the 
latter case, o(T) = ir :n=1,2,...}, where (A) is a complex 


anquence which converges to Q. 


THEOREM 1.8.3. Suppose that T 1s a compact linear operator 
acting on a complex Banach space X, and M is a closed invariant 
nuhbspace of T. 

(i) The restriction T™ of T to M is a compact linear operator 
on M. 

(11) The mapping Ty : xtM — Tx+M 1s a compact linear 

operator on the quotient space X/M. 


(Gii) If à € p(T) then M is invariant under (M-T)"'. 


Proof. Part (i) of the theorem is an immediate consequence of 


the definition of ‘compact linear operator’. It follows from Theorem 
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1.8.2 that the resolvent set p(T) is connected, so (iii) is a 
consequence of Theorem 1.5.3. 

For x in X, let [xl denote the coset x+M. Given a sequence (Ix) 
in X/M such that sup ||[x_]|| < ~, we can choose y, in [x] such 
that ||y, || < 2\|[x,J||. Thus sup |ly, || < œ, and so there is a 
subsequence (y, (qy such that (Ty, (ay converges to an element y of 


X. It follows that 


Zyl coHUylll = lI Tuelyp cay H 


HET gyi < ITY atal! -> 0 


as q —> œ. This shows that T y is a compact linear operator on X/M, 


and completes the proof of the theorem. 


We now specialize to the case of a compact linear operator 
acting on a Hilbert space. The next few theorems can be formulated 
to as to apply to compact operators acting on Banach spaces (see, 
for example, [59: pp. 275, 282, 283]), but it is the Hilbert space 
versions that will be needed in later chapters. The more elementary 
results are, again, stated without proof. 

If T is a compact linear operator acting on a Hilbert space K, 
then the adjoint T* is compact. If A is a non-zero complex number 
then, in addition to the subspaces I, and Ki, introduced in (1) and 
(2), we define 


(3) J = fxe H : AL-T*}x = 0}, 
(4) R = {AL-T**x : x € KH}. 


Since R and Re are closed (Theorem 1.8.1), they are the closed 
range spaces of the operator (AI~T)* and its adjoint (AI-T*}, 
respectively. By 1.7(6), 
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(4) Nees Aps OU) 


We use the notation just introduced in the following theorem. 


THEOREM 1.8.4. The spaces Is and y have the same finite 
thmension (k = 0, 1, 2,...). The index of À with respect to T is the 
wamo as that of À with respect to T*; when it is positive, the 
geometric and algebraic multiplicities of X as an eigenvalue of T 


a the same as those of À as an eigenvalue of T* 


It is implicit in Theorem 1.8.4 that a non-zero complex number À 
ia nn eigenvalue of T if and only if À is an eigenvalue of T*; this 


van also be deduced from Theorem 1.8.2, together with 1.7.(4). 


THEOREM 1.8.5. Suppose that T is a compact linear operator 
ncting on a Hilbert space H, and Pa is the closed subspace of H 
aencrated by the set of all principal vectors of T associated with 
non-zero eigenvalues. Then Se 1S invariant under T* and the 


tentriction N of T* to J. 1s quasi-nilpotent. 


Proof. If A is a non-zero eigenvalue of T, let 


(D) JCA) = ix eH : ALTYx = 0 


for some positive integer n}, 


Ihe space of all principal vectors of T associated with À. Thus I; 


in the closed subspace of H generated by 
U UA) : A € o(T), à 0}. 


Since each J(A) is invariant under T, so is Jas If ye J then, for 


each x in io we have Tx € J and so 


<T*y, x> = Cy, Tx> = 0. 
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Hence T*y € Te, whenever y € P~; that is, Ja is invariant under T* 
The restriction N of T* to Pa is a compact linear operator on 
Fo Suppose that it is not quasi-nilpotent. Then there is a non-zerc 

u in o(V) and, by Theogem 1.8.2, p is an eigenvalue of N. Let x be 
a non-zero vector in ee such that Nx = px (that is, T*x = px). Since 
u is a non-zero eigenvalue of T*, its complex conjugate A is a 
non-zero eigenvalue of T. Let v be the index of A relative to T, so 
that A has index v relative to T*. With the notation introduced in 


(1), (2), (3), (4) and (6), we have N, =A) C#, and, by (5), 
sad a |S RE, 


It follows from Theorem 1.8.1 that the restriction of AI-T* to Rx 


has an inverse in BRE), and is therefore one to one. However, 


xe CR*, x #0,  OIA~T*)x = px-T*x = 0. 


This contradiction arose from the assumption that N is not 


quasi-nilpotent; so the theorem is proved. 


Suppose that T is a compact linear operator acting on a Hilbert 
space K. By Theorem 1.8.2, the non-zero elements of o(T) can be 
arranged as a sequence (Hi? Bar» .). Suppose that H; has algebraic 
multiplicity m(j) as an eigenvalue of T, and that (A) is the sequence 
obtained by taking m(1) terms K4» then m(2) terms lz?» then m(3) 
terms p3, and so on. We shall describe An) as the sequence of 
non-zero eigenvalues of T, counted according to their algebraic 
multiplicities. The above conditions do not determine (A) uniquely, 
since the non-zero points of o(T) can be arranged in any order as a 
sequence VDE usually, this ambiguity does not matter, but 
occasionally we impose additional conditions on (p) to ensure 


uniqueness. 
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Ti OREM 1.8.6. Suppose that T is a compact linear operator 
a ting on a Hilbert space K, A) is the sequence of non-zero 
eigenvalues of T (counted according to their algebraic multiplicities) 
niil ae is the closed subspace of H generated by the set of all 
pincipal vectors of T associated with the non-zero eigenvalues. 
: n=1, 2, sa OL Sa such 


then there is an orthonormal basis tp, 


that CTO 0Y = Àp 


Proof. We use the notation introduced in the paragraph 
pieceding the statement of Theorem 1.8.6, and denote by 1(j) the 


Index of H, relative to T. Thus 
(7) NG) =txeH : T” Px = 0} 


In (he space of all principal vectors of T associated with the 
elyenvalue H; it has dimension m(j), and is invariant under T. If 
l denotes the restriction of T to JI(j), then the operator pA-T, on 
H) is nilpotent. There is a basis e(j, 1), ej, 2),..., eG, mG)) of 
M), with respect to which pI-T, is represented by a super-diagonal 
matrix A = (a, 9) [26: p. 107]. Since A is nilpotent, its diagonal 
tuelficients (which are its eigenvalues) are all zero. Thus ap l= 0 
tk oP), and 

l-1 
(K) H-D, D = Ea, peti K 


Let (Y) be the sequence of vectors obtained by taking first 
ol, 1),..., e(1, m(1)), then e(2, 1), ..., e(2, m(2)), then 
(4, 1), ..., €(3, m(3)), and so on. Since the sequence (An) 
‘onsists of u, (m1) times), p,(m(2) times), and so on, it follows 
thal the integers n for which An = k; for some preassigned j are 


pecisely those for which y, has the form eG, Py (<< m()). 


56 COMPACT NON-SELF-ADJOINT OPERATORS 


This, together with (8), implies that (A,[-T)y%,, can be expressed in 


the form 


n-i 


(9) OTM È Benth (= 1, 2, ood) 


We shall now show that the sequence W) is linearly independent 
Suppose that this is not so. Then there exist an integer n and 
scalars a(j, k) (k=1, ..., m(j); j= 1, ..., n), not all zero, such that 

n = m(j) 


2, > alj, k)e(j, k) = 0. 
j=l k=1 


(10) x; € JU), Per x, = 0, 


m(j) 


x = È aj, Del, kb G=l,..., n). 
i k=1 


Since elj, 1), ..., eG, m(j)) are linearly independent, and some 
a(j, k) is non-zero, the corresponding x, iS non-zero. 

Let S, = (I-T) 0). By (7), S;x = 0 whenever x € NG). If 2 Zj 
and x € (£), then 


Six = (u,I-T) "Px = (u -Tpx 


However, since ppl-T¢ is nilpotent, the spectrum of Tọ consists of 


Hp only, and so u; l-Tg is invertible. It follows that 


Sx =0 (x€ NG); S.x € JUD ~ {0} if x © TUL) ~ {0}. 
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Hence the operator Rọ = Di ios Sp_y Pai kan S, annihilates JU(j) 
when I- j;<nandj #0, but is one to one on NO. From (10), 


n 
0 = i Rox, 2 Roxp, 
anil no xy = 0 (C=1, n): This contradicts our previous 
tonelusion that at least one x, iS non-zero, and so establishes our 


aaneriion that the sequence (Ww) is linearly independent. 


Let (p) be the orthonormal sequence obtained from (ys) by the 


Achmidt, orthogonalisation process. With the notation used in 
Theorem 1.6.5, 

(11) Pa EL, e W) = Ly, 2 UD. 

Mt follows from (9) that C E € El, ae wi furthermore, 
(fl. m<n, then 


Al-Din = gdm An ED, 


© Wa wae, UG Wi aa Y) 
Thin, with (11), shows that 
ADO, E Ly oaos ty 1) EO O; 


and so 
0 = <A I-19, % > = A,- LTO, o>. 


We have now constructed an orthonormal sequence (9) such that 
¢ Mn, >= Àn The closed subspace M generated by the Ọs is 
Ibe same as the one generated by the Ý, S- Since each wy isa 
pincipal vector of T associated with a non-zero eigenvalue, and 


euch such principal vector is a linear combination of yr ’s, it 
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follows that M = pe Hence (Q) is an orthonormal basis of P, and 


the theorem is proved. 


THEOREM 1.8.7. Suppose that H is a Hilbert space and 
T ¢ BK). Then the following three conditions are equivalent. 
(i) T is compact. 
(11) Given any orthonormal system Up : kekK}inŅ, 
[TY Yp > 0 as k — œ (in the sense of $1.2). 


(111) There is a sequence (F) of operators of finite rank on K, 


such that ||T-F|| > 0 as n => œ. 


Proof. Suppose that (i) is satisfied and (ii) is not. For some 
orthonormal system ty, : k € K} and some positive 5, the set 
fk eK: | LTY w,>| > 26} is infinite; it therefore has a countably 


infinite subset which, by change of notation, we can take to be the 


set of all positive integers. Thus t¥, : n= 1, 2, ...} is an 
orthonormal sequence, and 
(12) | LTY > Pa! >28 (n=1,2,...). 


Since T is compact, there is a subsequence (n(j)) of the positive 
integers such that (Ty, Gy converges to an element x of H. By 
deleting a finite number of terms of this sequence, we may suppose 


that 
Te yn <0 G1, 2; 2.2): 
Thus 


| Thng Pag 7 Pag = | CT nG” TRODA 
< Te, Gl < ò, 
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anl no, by (12), | <x, Yag) > (j=1,2,...). Hence the series 
à | <x, Gaol diverges, contradicting Bessel’s inequality. This 
shows that (i) implies (ii). 

Suppose that (ii) is satisfied. Given a positive integer n, 


tonnide the class 8 of all orthonormal sets A in H for which 


|<Ta, a>| > i (a € A) 
An 


(we ullow the empty set as one possible choice of A). By (ii), each 
Ain’ is a finite set. Since the union of a strictly increasing 
mequence of sets in Š is again a member of 5 (and therefore finite) 
puch such sequence terminates. It follows that ò has a maximal 
element B. If M is the (finite-dimensional) subspace of H generated 
hy 8, then |< Tx, x>| < (4n)! whenever x € M~ and ||x|| =1; for 
ullierwise Ò contains B U {x}, contradicting the maximality of B. 
Thus |< Tx, x>| < n~t whenever x € M™ and ||x!| < 2; it now follows 


from equation 1.7(2) that 
(3) |<Tu, v>] <a! (vem, |[ul] < LII < D. 


lty taking u = (I-P)x and v = (l-P)y, where P is the projection from 
I onto M, we deduce from (13) that 


|<(U-P)TC-P)x, y X| < n! 


whenever x, y € K, ||x|| < land ||y|| < 1. Thus ||(-P)T(-P)|| 


n+. The operator 


F „ = PT+TP-PTP 


has finite rank, and || T-F 1 < n™!. This shows that (ii) implies 
(iti). 
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Since operators of finite rank are compact, and the set of 
compact linear operators is closed in B(K), it is apparent that (iii) 
implies (i). This completes the proof of the theorem. 

It is not known whether or not conditions (i) and (iii) in Theorem 
1.8.7 are equivalent for bounded linear operators acting on a general 
Banach space; for most of the particular Banach spaces encountered 


in elementary analysis, an affirmative answer is known. 


THEOREM 1.8.8. Suppose that T 1s a quasi-nilpotent operator 
acting on a Hilbert space H, and the skew-adjoint part Im T is 


compact. Then T 1s compact. 


Proof. Let A= Re T, B= Im T, so that A and B are self-adjoint 


T = At1B, and B is compact; we have to show that A is compact. 
Suppose the contrary. By Theorem 1.8.7, there is an orthonormal 
system ÍY, : k € K} in H such that (Ay, Yp) #0 ask œ. For 


some positive 6, the set 


K, = tkeK : |<Ay,, uy >| > 5) 


is infinite. Ifi~, : n=1, 2,...}is a countably infinite subset of 


VA : ke Kos then (p) is an orthonormal sequence and 
(14) [CLAP P| >Ò (n=1, 2,...). 
Since 
Kato 0 < Taol] 
= CA” , AQ > 
= <A*™Q , o> 
for all positive integers m and n, it follows from (14) that 


(15) [CAP 9 >| > 5" (a= 1,2, ...) 
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whenever m= 2%for some q = 0, 1, 2, ... For sufficiently large m 
of the form 2%, ||7™||1/™< 8, and thus 


5™ > |[T™\| = |\CArB)™|| = [|A-CT| 


for some compact linear operator C. By Theorem 1.8.7, 


LCY, p >—0asn— æ. Hence 


|<A™0.,, © >| < LAT- mll LC 9,51 
< ||A™-C]| + [<C> BD] < 8” 


for sufficiently large n. This contradicts (15), and so completes the 


proof of the theorem. 


We conclude this section by describing a class of examples of 
compact linear operators. Suppose that (E, 6, p) is a o-finite 
measure space, (E x E, & x 6, p x p) is the product of this measure 
space with itself, and h € LE x E, p x p); for the terminology used 
here, we refer to [25: Chapter 7]. By Fubini’s theorem, 


(16) lall? = f de(s) J |as, D| duct) 
E E 


(here, and subsequently, the norm of any function refers to the usual 
norm in the appropriate L, space). For almost all s in Ẹ, h(s, ĝis 
of class L(E, p) as a function of t, let Z denote the exceptional 
set of measure zero. If f € LE, u), it follows from the 


Cauchy-Schwarz inequality that the integral 
(ThA (s) = S Ws, DEOH 


exists whenever s € E ~ Z, and 


(17) (7, (s)|? < Illl? : |ACs, 0|? duce). 
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It is readily verified that the function Tf (defined arbitrarily on Z) 
is measurable. From (16) and (17), 


J TRD ds) < AP S dus) J. las, D| aO 
= |||? 1al’, 
so T f € L,(E, p) and ||T,fl| < ||Al| |||. It follows that T, isa 


bounded linear operator on the Hilbert space L,(E, u), with 
IIT I] < [A||]. We shall refer to h as an L, kernel, and to T, as its 


associated integral operator. It follows easily from Fubini’s theoret 


that, if f, g € L,(E, u), then 


G8) KT ED = SL Ws D KS KOD. 


From this, it is readily verified that the adjoint T% is the integral 
operator associated with the L, kernel h*, where h*(s, t) = A(t, s). 

We assert that T, is a compact linear operator. To prove this, 
suppose that iw, : k € K} is an orthonormal system in LAE, p). 
The functions Yo defined on E x E by Y, (s, #) = AORO form 
an orthonormal system in L,(E x E, px p). With f= g = Vy it 
follows from (18) that 


CT, Vp uD» = <h, T 


and Bessel’s inequality asserts that 


> T , Dic 2 2 
KEK I< ak Pg? ee IC h, Y> <S ||) | ° 
Thus (T Yp WY > 0 as k + œ. By Theorem 1.8.7, T, isa 
compact linear operator. 

Finally, we assert that T, = 0 if and only if h(s, t) = 0 almost 


everywhere on Ex E. The ‘if’ part of this statement is an 
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immediate consequence of the inequality iene < liaj]. Now 
muppose that T, = 0. To prove that h(s, t) = 0 almost everywhere 
m fe. E, it is sufficient to show that h(s, t) = 0 almost everywhere 
mit x Eo, whenever Ey is a measurable subset of E with 

jh): œ. Let S denote the class of all measutable subsets S of 


I. « Ey which satisfy 
(19) i J h(s, Odu(s)du(t) = 0. 


An u function on Ep x Ep, his of class La, and therefore (since 

h, - E, has finite measure) of class L,. If S,, S,, Sy. € S, and 
Ihe sequence (S) is either increasing or decreasing, it follows 
manily from the dominated convergence theorem that lim S, € Š. 
Following [25: p. 27] we describe this property by saying that dis 
u monotone class. If A and B are measurable subsets of Ep we 
iun take f and g in (18) to be the characteristic functions of B and 
A respectively, and deduce that Ax B € ò. It follows that S 
‘ontains the ring consisting of all finite disjoint unions of such 
seis Ax B. Since òis monotone, it contains the o-ring generated 
ly this last ring [25: p. 27]; that is, ò consists of all measurable 
nubscts S of Ep x Ey and (19) is satisfied for all such S. By taking 
loa S, in turn, the four sets on which the real and imaginary parts of 
h À both have constant sign, it follows from (19) that h(s, 0 = 0 


ulmost everywhere on Ey x Eo: 


19. Compact normal operators 

We begin this section by describing a simple process for 
roustructing compact linear operators, in particular, compact normal 
uperators. Suppose that is a Hilbert space, (Qj Pos e .) and 
(fy, Uy, ...) are othonormal sequences in K, and (A,, Ao: ...J isa 


neryuence of non-zero complex numbers satisfying 
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(1) Agl > [Ag] > [Agi >... 


We require that the seqyences (A,), (9), (y,) all have the same 
number of terms (finite or countably infinite); when they are infinite, 
we suppose that (A) converges to 0. 


For each x in K, 
Z JA <a OD? < A E <a OD? < JA llla. 
Hence the equation 
(2) Tx = 2X, <x, PW, 


defines an element Tx of K, and ||Tx|| < [A | [[x|[. It follows that 


T is a bounded linear operator on H, and ||T|| < |A,|. Furthermore 


i 
To, =A, G=12..> 

by taking j = 1, we deduce that ||T|| > |A,|, so 

(3) TI = fàl. 


We assert that T is a compact linear operator. If the sequence 
A) is finite, then T has finite rank and the result is apparent. If 
(A) is an infinite sequence then, for each k = 1, 2, ..., we can 


define an operator T, of finite rank by 


k 
TX = Pea A, <x, Py; (x e KH), 


Since 


oo 


(T-T,)x = a A <x Ob, (Coe 
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it lollows from (3), with T-T, in place of T, that 
IT-7 || = Agl > 0 


um k >». By Theorem 1.8.7, T is Compact. 

The conditions imposed in the first paragraph of $1.9 remain 
malisfied when (9) and (ys) are interchanged and (A) is replaced by 
Ihe complex conjugate sequence (,). It follows that the equation 
(A) T'y = È ACY, b>, (ye) 
delines a bounded linear operator T’ on X. By (4) and (2), 


<T’y, x> = 2 ACY, b> C9, x>, 


<Tx, y> 
BAe O Cy” 
<T'y, x» 


<x Ty> (ved. 


ll 


<x, T*y> 


tI 


Thus T*= T’, and therefore 
(5) [ye Ay, uD 7 QE 1). 
Now suppose that 9, = Y, (j=1,2,...). It then follows from (5) 
sal (2) that 
|T*x]]? = E JA, <x o>]? = [ITHl|2 (xe HO; 


and, by Lemma 1.7.3, T is normal. 

We have now shown that, if the sequences (9), (wu) and A) 
aulisfy the conditions set out in the first paragraph of this section, 
Ihen equation (2) defines a compact linear operator T; and T is 


mamal if Y, = ý, for all j. Our main purpose in $1.9 is to show 
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that every compact normal operator arises in this way. From this, 
we shall deduce that each compact linear operator can be 

4 
represented in the form (2), by an appropriate choice of Aj, Y, and 


p Before proving these results, we require the following lemma. 


LEMMA 1.9.1. Let T be a compact normal operator acting on a 
Hilbert space H. 
(i) T has an eigenvalue A, satisfymg |A,| = ||T||- 
(ii) Every eigenvalue of T has index 1. 
(iii) If Nis a scalar and x € H, then Tx = Ax if and only if 
T*x = Xx. 
(iv) If x and y are eigenvectors of T, associated with distinct 


eigenvalues À and p respectively, then <x, y >= 0. 


Proof. By Theorem 1.7.4, the spectral radius of T is ||T||, so 
there is an element A, of o(T) satisfying |A,| = ||T||. If T #0, 
then A, #0, and A, is an eigenvalue of T by Theorem 1.8.2. This 
proves (i), since the stated result is obvious when T = 0. 

By Lemma 1.7.3, ||Tx|| = \|7*x|| for each xin X. With T 


replaced by the normal operator T-A I, we have 


||7x-Ax|| = || Z" x-Ax 


2 


which proves (iii). 
Suppose that À is an eigenvalue of T, x € K, and (T-A 1)*x = 0. 


By (iii), with Tx~Ax in place of x, we have 
(T*-AD(T-ADx = 0. 
Thus 
\(T-ADx||2 = < (T*-AD(T-ADx, x> = 0, 


and therefore (T-A I)x = 0. This proves (ii). 
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Under the conditions set out in (iv), 


ACY, x> = Cy, Ax = Cy, T*x) 
= (Ty, x> 
= CHY, x X= BY, XD. 
Mince A Ay, it follows that Cy, x>=0. 


Suppose that T is a compact normal operator acting on a Hilbert 
apuce JC By Theorem 1.8.2, the distinct non-zero eigenvalues of T 
tan be arranged as a (finite or infinite) sequence (ki Hos +> .); and, 
In the infinite case, (un) converges to 0. We may assume that the 


p» have been ordered so that 


(0) ty! > [Mol > lezl > --. 


The algebraic multiplicity m(j) of H; as an eigenvalue of T coincides 
witli the geometric multiplicity, by Lemma 1.9.1(ii). Let (A,) be the 
mpqience consisting of m(1) terms p, , then m(2) terms Hy, then m(3) 

lens poo, and so on. If (u) is a finite sequence, so is (A); in the 

inlinite case, (A,) converges to 0. 


‘The null space JU(j) of u,I-T has dimension m(j) and so has an 


mthonormal basis eG, 1), ej, 2), ..., eG, m(j)). Let (@,) be the 
meduence of vectors obtained by taking first e(1, 1), ..., e(1, m(1)), 
then e(2, 1), ..., e(2, m(2)), then e(3, 1), ..., e(3, m(3)), and so on. 


or any given j, the integers n such that A, = p, are precisely those 


hu which @, has the form e(j, f) (1 << m(j)); so 


Ny / kthen <e(j, Q, e(k, m) > = 0, by Lemma 1.9.1 (iv); since 
ej, 1), ..., ej, m(j)) is orthonormal, for each j, it follows that (P) 


in an orthonormal sequence. We shall describe (A,) as the sequence 
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of non-zero eigenvalues of T, arranged in order of decreasing 
magnitude and counted according to their multiplicities; (P,) is the 
corresponding orthonormal sequence of eigenvectors. In general, 
the sequence (À) is not uniquely determined; for distinct 
eigenvalues of T may have the same absolute value, and then (6) 
does not completely specify the order of the H,;’S. However, (A) is 
uniquely determined if all the H,’s are positive; we shall see later 
that this last condition is equivalent to the statement that T is a 
positive operator. | 

If we take Y = Q, the conditions set out in the first paragraph 


of $1.9 are satisfied. It follows that the equation 
(7) To= 2A <x 9-0, (x c H) 
defines a compact normal operator Tọ on K. We shall show that 
T=To 
Let M be the closed subspace generated by the vectors P» and 


denote by P the projection from H onto M. For each n, 


by linearity and continuity, T,y = Ty whenever y € M. Thus 
TE = TP. However, 
T 4x 2 A<% PY 
Z A <x, P9? 0, 
> A LPZ P >, 
= T Px (x € ÍO, 


lI 


t 


i} 


so To = ToP. Thus 


(8) TP =T, 
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Also, since PTx € M whenever x € K, and (p) is an orthonormal 


basis of M, 
PTx = È CPTx, 0 >®, 
- 5 <x, T*P9 >. 
Since TPY: = T*9 = A? by Lemma 1.9.1(iii), it follows that 
PTx => A <% 0, >e, = Tox. 


This, with (8), shows that TP = Tọ =PT. By taking adjoints, we 
obtain PT* = T*P. Since T is normal, it follows that the operators 
PT, T* all commute. Hence Lry (= T-TP) is normal. 

Suppose that T # Tọ- By Lemma 1.9.1(i), T-T, has a non-zero 
eigenvalue u. Let ọ (¢ 0) be a corresponding eigenvector. Since 


(T-T 9, = 0 and (T-T) = uỌ, it-follows from Lemma 1.9.1(iv) that 
(9) <y, p >=0 (n=1,2,...). 


By (7), T? = 0, so TỌ = (T-T) )Ọ = pO. Thus p is a non-zero 
eigenvalue of T, so p = H; for some j, and 9 € NG). However, NG) 
has a basis consisting of a finite set of ps, and by (9), Ọ ERN”. 
Hence 9 € NG) a JIG)” = {0}, contradicting our assumption that 
ọ #0. It follows that T = To 

The results of the preceding paragraphs are summarized in the 


following theorem. 


THEOREM 1.9.2. Suppose that T is a compact normal operator 
acting on a Hilbert space K, A) 1s the sequence of non-zero 
eigenvalues (arranged in order of decreasing magnitude and counted 
according to their multiplicities) and (9) is the corresponding 


orthonormal sequence of eigenvectors. Then 


Tx => À <x PP, (x € H). 
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We have shown that every compact normal operator T can be 
represented in the form (2), with y, = =q and (A, ) the sequence of 
non-zero eigenvalues of T. This as. in e EE to compact 
self-adjoint operators. By comparing (2) and (5), in the case where 
9, = Ws it follows that a compact normal operator T is self-adjoint 


if and only if each of its eigenvalues is real. By (2), with Y, = Us 
CTx, x> = ZA l<% OD? (xe KH, 


so Tis positive if and only if the à ’s are all positive. 

The functional calculus for a compact self-adjoint operator (see 
Theotem 1.7.7 and the remarks following its proof) assumes a very 
simple form. We may assume that T is represented as in Theorem 
1.9.2, with all the A_’s real. Suppose that f € C(o(T)). If 
<z, ©, >X = 0 for each n, then Tz = 0, TO, = Ana it follows from 
Theorem 1.7.7(vii) that f(T)z = f(0)z and EDO, = EAP. Every 


vector x has the form 
Se, D>? + 2 
where (z, P, > = 0 for all n. Thus 
{(T)x = È <x, PEY +f (Tz 
=: ECA) Cx, P, >o_+4(0)z. 
The case in which f(0) = 0 is of particular interest. In this case, 
(10) f{(T)x = È f(A) <x o>, (x e H). 


If the sequence (A_) is infinite, it converges to 0; by continuity, 
f(A) — {(0)=0. If follows that f(T) is a compact normal operator 
if (0) = 0 

We now revert to the study of a general (not necessarily normal) 


compact linear operator T. By use of Theorem 1.9.2, together with 
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the polar decomposition of T, we shall show that T can be expressed 
in the form (2), with sequences A) (Y); a) satisfying the 
tondilions set out in the first paragraph of $1.9. 

Suppose that H is a Hilbert space and, for each T in BK), let 
'R p denote the closed range space of T. We recall from Theorem 
1.7.12 that there exist a positive linear operator H (= Tr T)1/2), 
and a partial isometry V with initial space Kane (= Ra) and final 
apace Nas such that T = VH, H = V*T. From these relations it 
lnllows that T is compact if and only if H is compact; and T has 


finite rank if and only if H has (the same) finite rank. 


THEOREM 1.9.3. Suppose that (p) 1s a decreasing sequence 
ul positive real numbers which is either finite or infinite and 
tomverpent to 0, and that (Q_) and (Y) are orthonormal sequences 


ini Hilbert space H. Then the equation 


(11) Tx = È u, <x P Yn (x € Í) 


Helines a compact linear operator T on H; furthermore, T has finite 

tank k if and only if the sequence (u„) terminates after just k terms. 
Conversely, if T is a compact linear operator acting on H, then 

| «sm be expressed in the above form; the sequence (uD is uniquely 

Uutermined, and consists of the eigenvalues of (T*T)'/2, arranged 

in order of decreasing magnitude and counted according to their 


multiplicities. 


Proof. We have already seen, at the beginning of $1.9, that 
funtion (11) defines a compact linear operator T, with adjoint 7* 


hallslying 


(14) T*y = % L, <I: Wa? O, (y e H), 


72 COMPACT NON-SELF-ADJOINT OPERATORS 
Thus TY = 1,9, and, by (11), 


T*Tx = U.4,<x% OT, 
== pdx odo, (xe HK). 


With f defined on the spectrum of the positive operator T*T by 
f(t) = 11/2 it follows from (10) (with T*T and i in place of T and 
A? respectively) that the operator H = (T* p 2 satisfies 


(13) Hx =X p <x 9X0, (xe H). 


From (13), and the assumption that (u„) decreases, it is easily 
verified that (u) is the sequence of non-zero eigenvalues of H, in 
decreasing order and counted according to their multiplicities. 
Since H is a positive operator, this implies that (Hn) is uniquely 
determined when T (hence H) is given. It is clear from (11) that 
(W) is an orthonormal basis of dea; so T has finite rank k if and 
only if (p) terminates after just k terms. 


Conversely, let T be a compact linear operator acting on H, and 


let its polar decomposition be VH, so that H (=(T*T)!/2) is positive 


and compact. Let (Hp) be the sequence of non-zero eigenvalues of 


H (in decreasing order, and counted according to their multiplicities). 


By Theorem 1.9.2, there is an orthonormal sequence (@,) such that 


(13) is satisfied. Thus 
Tx = VHx = % t,o Ov, & eH) 


where fy, = VQ. Now V is a partial isometry with initial space Kes 


and P, (= pH.) lies in Ro SO V*VQ, =. Thus 
* 
Cin? Wa? z [VIn Ve, > a <V VO, 0,> = Con? Pa 


Hence (Y) is an orthonormal sequence, and the theorem is proved. 
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If H is a Hilbert space and x, y € H, we define a bounded linear 
operator x @ y on H by (x @ y)z = <z, y>x (z€ H). It is clear that 
Ilx @ yi| = ||x|| ||y||, and that x @ y has rank 1ifx40+4y. Now 
nuppose that T € BCH) and T has finite rank m. It follows from 


Theorem 1.9.3 that T can be expressed in the form 
m 
T = cae HY, 8 O 


where (A) is a sequence of positive real numbers (the eigenvalues 


of (T*T)1/2) and (9), (fh) are orthonormal sequences. By (12), 


m 


x 
D Par hnn O YW» 
und so T* has the same finite rank as T. If T has polar 


decomposition VH then, by (13), 


m 


H = Rca L Vn, 8 Q. 


CHAPTER 2 
The von Neumann-~Schatten Classes of Operators 


2.1. Introduction 

This chapter is concerned with certain classes C, (1x p< œ)of 
linear operators on a Hilbert space H which were introduced by von 
Neumann and Schatten[49]. It turns out that each of these classes 
is a two sided ideal in B(H), and consists of compact operators. 
When provided with a suitable norm, C becomes a Banach space 
with properties closely analagous to those of the sequence space by 
(see $1.3). The development of this theory involves difficulties, 
arising from the non-commutativity of O which have no parallel in 
the case of l | 

Throughout Chapter 2, H denotes a Hilbert space and J is a set 
whose cardinality is the dimension of H. Hence each orthonormal 
basis of H can be expressed as ig, : j € J}, while a general (not 
necessarily complete) orthonormal system has the form U :kekK;, 


where the cardinality of K is not greater than that of J. 


DEFINITION 2.1.1. When 1<«xp <œ, C is the set of all 
operators T in BCH) which satisfy the following condition: for each 


orthonormal system iw, : k € K} in K, 


A KT ir Yy >lP <n 


kK E 


75 
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We shall adopt the convention that om is BCH). It is apparent 
from the properties of l spaces which are described in $1.3 that 
C, is a linear subspace of BCH), and that C, a Ca ifl< p<sq<S. 


Furthermore T* € C whenever T € Cy and so C; contains the 


self-adjoint and skew-adjoint parts of each of its members. It follows 


from Theorem 1.8.7 that, if 1 < p < œ, then each element of A isa 


compact operator. 


LEMMA 2.1.2. Suppose that 1 < p < œ, Tis a compact self- 
adjoint operator on H, and (A) is the sequence of non-zero 
eigenvalues of T, counted according to their multiplicities. 


(i) IET eC, then X |A |P < o. 
(ii) If X |A,|P < œ, then T €C, and, for each orthonormal 
n 


system iW, -k €K}ink, 


p 
a) CE The ie DP < E DAP. 


Proof. There is an orthonormal sequence (Q, ) in H for which 
Tx = z A <x oyo, (x €H). 
If T € C then, by Definition 2.1.1, 
2 IAP a GPP < 


Suppose conversely that È |A |P < œ. If {yp :k € K} is any 


orthonormal system in K, then 
2 
CTh We) = E Ml Chie aD: 


With q the index conjugate to p, Hölder’s inequality (with the usual 


interpretation when p = 1 and q = œ) yields 


von NEUMANN-SCHATTEN CLASSES 77 
[<TH Wl S E Aal Khe WPK S 
SEAPIK oT TAEK ©, TA 
Since È |Cw,, p, >l? < IAIK = 1, we have 


2 ILT ys We Dl? < > > IAAI? [Co l? 
< 2 [A |? > Ko, ¥ >I? 


< = AAI loll? = E Anla. 


Thus T € C and (1) is satisfied. 


We have seen in Theorem 1.8.7 that an operator T on H is 
compact if and only if it is the limit in norm of operators of finite 
tunk. It follows at once that T is compact if and only if there is a 
NCquence (F) of operators on K, for which F has finite rank not 


more than n and lim ||T-F, || = 0. Our next result shows that 
n >œ 


membership of C is equivalent to a qualitative statement about the 


rapidity with which || T-F, || can converge to 0. 


THEOREM 2.1.3. Suppose T € B(H) and1 < p < œ. Then 
T C, if and only 1f there is a sequence (F) of operators on H, 


such that F has finite rank not greater than n and 


5 ||T-F ||P <a 
=1 


Proof. Let 2) denote the class of all operators T on H for which 
much a Sequence (F) exists. Given S and T in Î and scalars a and 
{, choose operators F, and G, with finite rank not more than n for 


which 
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(2) £ ||S-F ||P < o, s ree <3, 
= j n=1 i 


and define a sequence (H) by 


H=, H, =H 


2n ~ “ ont+1 ` aF +BG_ CA 2 Bele aes): 


Then H has finite rank not greater than n, and 


SZ |las+@7-H_||P 
2 


= an ||a(S-F, + T-G,) ||P 


oQ 


<2 2 tal RI eee Te 


The last quantity is finite, by (2) and Minkowski’s inequality, so 
aS+BT €D. Furthermore S* e D, since F* has finite rank not more 
than n and 
S |istFeP = 2 ||SR IP <x. 
n=1 i n=] sg 
Finally, S is compact since it is the limit in norm of the sequence 


(F ) of operators of finite rank. 


It follows from the results just proved that Î is a linear subspace 


of ‘B(H) which consists of compact operators and contains the 


self-adjoint and skew-adjoint parts of each of its members. We have 


already seen that A has these properties. Hence each of the 
subspaces Î and 2 is the linear span of its self-adjoint elements; 


and in order to prove that D = Cy it is sufficient to show that a 


compact self-adjoint operator T lies in D if and only if it lies in Ca 


Given such a T, we may suppose that 


+ ma 
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Tx = ae d <x, 0 > Px (x e FO, 
where (Pa) is an orthonormal sequence and (A) is the sequence of 
non-zero eigenvalues of T (arranged in order of decreasing magnitude, 
counted according to their algebraic multiplicities, and followed by 
un infinite sequence of zeros if T is of finite rank). 


ITE oF define an operator L by 
n 
F x= ee À „L5 Pa Pr (x € J0. 


Then F, has finite rank not greater than n, 


(T-F_)x = ad An <% Pn >P, (x EH), 
||7-F,]| = sup Apl © m> al = |i 
By Lemma 2.1.2, 
OS (TRIP = Eyal <= 


so TED. 
Suppose conversely that T € PD. Let G, be an operator of finite 
rank not more than n (n = 1, 2, ...) for which È || T7-G, ||P <æ, For 


u fixed n, we may choose vectors xX > Ym (1< m<n)inH such that 


Gx= 2% <x, X Yn (x eH). 
m= 1 


Then G (a, 0,+ be 0 ap = 0 provided that the scalars 


ys +++) Q41 Satisfy the n linear equations 
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€a,9,+. g +4449, +1? X m? = 0 (m = 1, gt) With Qi: ee o Qati 


(not all zero) satisfying these equations, we have 
a 


N(T-G,) (191 +--+ PDI 

[Tla P+... +a,, 949) 
= Jà a P+. ' AA aa e 
2)1/2 


2 
[|A,a,|“+.. LOERT 


> |r Ita, |? Feret A 


n+l 
SAET loci tO Pray ll: 


Hence Angl <S IIT-G_||, 
SIAN < APs E. ||T-G|P <o, 
1 n= x 


and by Lemma 2.1.2, T € ee 


COROLLARY 2.1.4. C contains each operator of finite rank 


on H (1 < p<), 


Proof. This is obvious when p = ~, and follows at once from 


Theorem 2.1.3 when 1 < p < œ. 


LEMMA 2.1.5. ` Supposel<p<.o, T€ C and A € R(H). Then 
AT EC TA € Ci 


Proof. We may suppose 1% p < ~, since the result is evident 
when p = œ. By Theorem 2.1.3 there is a sequence (F ,) of 
operators on H for which F, has finite rank not greater than n and 
2 ||T-F ||P < œ. Since AF „ and F „A have finite rank not more than 


n and 
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È ||AT-AF ||P < ALIP & |[T-F I? < ~, 


2 ||TA-F_Al|? < [IA]? & ||7-F 1P 


it follows, again from Theorem 2.1.3, that AT € A and TA € C 


Some of the results proved above are of purely technical interest, 
but will be needed in $2.3 when oi is studied as a Banach space. 
We conclude this section by summarising, as a theorem, the main 


ulgebraic properties of C which have been established so far. 


THEOREM 2.1.6. Suppose 1<p<%œ~. Then C, is a two sided 
ideal in BOA), It contains each operator of finite rank on H, and 
also contains the adjoint of each of its members. Ifi < p<», 


then each element of C is a compact operator. 


2.2. The trace on C, 
If H is an n-dimensional Hilbert space and P-o Q is an 


orthonormal basis in H, we can define a linear functional 7 on BH) 


hy 
r(A) = LAP P+... +KAG, > (A € BUD). 


llementary algebraic manipulations show that 7 does not depend on 

the choice of the orthonormal basis 9, ..., Pa and that, if A has 

matrix la, J with respect to some orthonormal basis, then 

r(A) = Qyyt--. +a,» the trace of the matrix la; J. For this reason 

we refer to r as the trace on B(H). It is easily verified that 

r (AB) = 7(BA) for all A and B in B(H), and that 7 (A*A) > 0 if A #0. 
It is not difficult to show that, if H is an infinite-dimensional 

Hilbert space, then there is no linear functional 7 on BH) such that 

(AB) = 7(BA) for all A and B in BM) and 7(A*A) > 0 if A #0. 
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There is, however, a linear functional 7 on the ideal C: of KAO) 
which satisfies these conditjons whenever A € Ci and B € BK). 
We shall now construct this functional and investigate its 


properties. 


LEMMA 2.2.1. Let T €C; and suppose that {0,: j € J} is an 
orthonormal basis in K. 

(i) The sum i CT9,, Y, > exists, and does not depend on the 
particular choice of the orthonormal basis ig, Poe] i. 

(ii) If T = T*, then 3 <TO,, > = 2 AW where (A) 1s the 


sequence of non-zero eigenvalues of T, counted according to their 


multiplicities. 


Proof. Since Gi contains the self-adjoint and skew-adjoint 


parts of each of its members, (i) is an immediate consequence of (ii). 


Under the hypotheses of (ii) we may choose an orthonormal sequence 
(Y) such that 


Tx = ZALE Va Y, (x eK. 


Then 
CTO» B> = EA lC al, 

a KP Als a ANa 
m Ue WaD 
= Z Anl (lea 
= F [Ml 


tae. | 
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und the last quantity is finite, by Lemma 2.1.2. It follows that the 


mum > LTY, p, > exists. A manipulation similar to the one just 
jEJ 
wiven, exploiting the finiteness of the double sum on the right-hand 


side of (1) to justify change in the order of summation, shows that 


y = 
J S ; Mn 


We shall eventually show that, in part (ii) of Lemma 2.2.1, the 
condition T = T* is redundant (cp. Theorem 3.3.13). 


DEFINITION 2.2.2. The ideal Ci in BCH) is called the trace 
class of operators on K. Te Ci and Lo, : j € J} is an orthonormal 
basis in K, then the trace of T, denoted by 7(T), is defined by the 


equation 


r(T) = = CT9,, 9. 


Lemma 2.2.1 shows that 7(T) depends only on T (not on the 
choice of the orthonormal basis), and that 7(T) is the sum of the 
eigenvalues of T when T = T*. Before establishing the main 


ilgebraic properties of 7, we require an auxiliary result. 


LEMMA 2.2.3. Each element A of ®(K) is a finite linear 


combination of unitary operators. 


Proof. Since each element of R(H) can be expressed in terms of 
its self-adjoint and skew-adjoint parts, it is sufficient to consider 
the case in which A = A* and ||A||<1. In this case the spectrum 
a(A) of A is contained in the compact interval [-1, 1], and we can 
define a continuous function f on ofA) by f(t) = t+ir/(1-#7). Let 
U = f(A), the operator corresponding to f in the functional calculus 


described in Theorem 1.7.7. Since 
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t= YEOH, (QF = FOE) = 1, 
it follows from Theorem 1.7.7. that A = 4(U+U*) and UU* = U*U =I. 


THEOREM 2.2.4. Suppose that S, T € Cy A € (HK) and a, B 
are scalars. 
(i) 7(aS+BT) = ax(S)+P7(T). 
(ii) r(S*) = r(S), 
(iii) 7(S) > 0 if S > QO. 
(iv) 7(AS) = 7(SA). 


Proof. Let {9, : j € J} be an orthonormal basis of H. The first 


; 
two parts of the theorem are obvious consequences of Definition 
2.2.2. Suppose now that S> 0. Then7(S)=} <S9,, o> >0, since 
each term in the summation is non-negative. If 7(S) =0 then, for 
each j, 0= <S9., 0.) = |S? /29.||7, so S1/? = 0 and $=0. This 
i j 

proves (iii). 

For (iv), it is sufficient, by Lemma 2.2.3 and the linearity of 7, 
to consider the case in which A is a unitary operator on H. In this 


case, (AQ, : j € J} is an orthonormal basis of K, and 


r(SA) = = <SAQ,, %> 
5 <SAQ,, A*AQ,> 


x CAS(A9,), AOD = 7 (AS). 


i 


2.3. The Banach space = 

In this section we introduce a norm on C and prove that C, is 
a Banach space with properties similar to those of the sequence 
space l The main results are contained in Theorems 2.3.8, 2.3.10 
and 2.3.12. 

Suppose that T is a compact linear operator acting on K, and 


denote by VH, the polar decomposition of T. Then T = V Hp and 
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H = VŽT = (T*T)*/?. It follows from Theorem 1.9.3, together with 
its proof, that there exist a decreasing sequence (u,,) of positive 


real numbe:s (the eigenvalues of H., counted according to their 


T? 
multiplicities), and orthonormal sequences (Q); a) such that 


(1) HX = Hn Cx, o> Qo 
(2) Tx = % l, <% P> Y, Qe H). 


The sequences (uy) (9), CD) all terminate after just k terms if T 
has finite rank k, and are infinite if T is not of finite rank. Given 
p(> 1) the function f defined by f = {P is continuous on the 
non-negative real axis, and hence also on the spectrum of the 
positive operator Hap. The operator f H r) Will be denoted by H’.. 
It follows from equation 1.9(10), and the discussion which follows 


it, that H7. is compact and that 
(3) Hix = > pe <x, o> QO, «& eK). 


It is easily verified that positive powers of H,. satisfy the usual 


indicial laws. 


LEMMA 2.3.1. Suppose 1 < q < p < » and T € B(H). Then 
the following three conditions are equivalent. 


(i; T eC, (ii) Hy € Cy (iii) HY 4 € Cy 


Proof. Since C is an ideal in B(H), the equivalence of (i) and 
(1i) follows at once from the equations T= V pH Hp = VT. 

If either (ii) or (iii) is satisfied, then H,. (= (He /a}4/P) is 
compact by Theorem 2.1.6, and we may suppose that Hp is 
iepresented as in equation (1). By Lemma 2.1.2, Hp € C if and 


anly if 2 pÈ < oo; this is equivalent to £ (uP Ia < œ, which (again 


’» 
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by Lemma 2.1.2) is a necessary and sufficient condition that 


He! De Cy Hence (ii) and (iii) are equivalent. 


From Lemma 2.1.2 and the equivalence of conditions (i) and (ii) 
in the lemma just proved, it follows that a compact operator T on H 
lies in C_ if and only if the sequence (u,,) of non-zero eigenvalues 
of Hp = (T*T)'/? satisfies 5 pe <œ. This is often used as the 


definition of A (see, for example, [16]). 


DEFINITION 2.3.2. Suppose that 1 < p < and T € C Then 
A 

We shall adopt the convention that ||-||_ is the usual norm on 
BH) (= C). The above definition of IZII with 1 < p < œ is 
meaningful since, by Lemma 2.3.1, HE € G whenever T € C It is 
not immediately obvious that Hells is a norm on C and before 
proving this we shall need some auxiliary results. 

Note first that, if l< p<, T€ G and equation (1) holds, then 
H is represented in the form (3), and the remarks following 


Definition 2.2.2 imply that r (HF) => pe. Hence 


te p1/p 
(4) ITI, = [Z ele, 
where (Hp) is the sequence of non-zero eigenvalues of Hy: counted 
according to their multiplicities. Since ||T|| = ||Hp|| = 41, 


(5) ITIL < lTIl, (T €C) 


If T is a compact normal operator, and (À) is the sequence of 
non-zero eigenvalues of T (arranged in order of decreasing magnitude 
and counted accotding to their multiplicities) then there is an 


orthonormal sequence (P) such that 


Tx => A, <x, P? P, (xe ÍD. 


von NEUMANN-SCHATTEN CLASSES 87 
Since TY, z AG, we have T*., = AnP, and thus 


T*Tx = z A [4 <x, 0, Op 


HX = $ A | <% P P, (x eH). 


Thus the sequence of non-zero eigenvalues of Hq is JAI), and so 


Er €, if and only if È |A |P < œ. When this is so, 
= pyi/p 
Til, = [E 411e. 
LEMMA 2.3.3. For each T in Ce \r(T)| < Ae alee 


Proof. We may suppose that H,, and T are represented as in 
equations (1) and (2). Then p, > 0 and È u, =7(H p) = ITI] Let 
10, : j € J} be an orthonormal basis of } which contains each P. 
If j e J and 0, is not one of the Ọ ’s, then <9, 9> = 0 for each 
n and, by (2), T9, = 0. Thus 


I7(T)| = |È <TO,, 0,>| 
< È |<T0,, 9, >| 
= 2 | CTP, >| 
= 2 luge Pr | 
< È |p| 
= |[T\|,- 


LEMMA 2.3.4. Suppose1l< p< andT € BCH). Then T € 5 


if and only if there is a constant M such that 


(6) l Z ISTH, pe)” <M 
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for every pair (Qasa Q), (Ps Eai Pa) of finite orthonormal 
systems in K. When this condition is satisfied the least such 


constant M is ITI 


Proof. Suppose that there is a constant M with the stated 
properties. Then = |<TO,, 0,2 |P < MP < © for each 
kKEK 


orthonormal system {0, : k € K} in H, because condition (6) shows 


that an arbitrary ee ahem is not greater than MP. It follows 
from Definition 2.1.1 that T € C 

Suppose conversely that T € a and that T has polar 
decomposition VpHp. Let (Q,, ceg O) and (i deys Ya) be 
orthonormal systems in K. Our first step is to show that, for the 
purpose of the present argument, the partial isometry V, can be 
replaced by a unitary operator. For this, note that the vectors 
Hpi -+ HỌ, span a finite-dimensional subspace £ of the closed 
range Space RCH) and that V,, maps £ isometrically onto a 
subspace M of R(T). With Xj, see, X, an orthonormal basis of £ and 
I= VEX; Yy ce Vy is an orthonormal basis of J. We may extend 


to form two orthonormal bases x vey Xr eves Xo and 


i” 
Yy eeu Ver eee Ve of the subspace JI = E-M. Each element of H 
can be uniquely expressed in the form @,X,+... tax +z, with z in 


Yt, and the equation 


Ula, Xt... +0, x +2) = y+... +2 Vint 
defines a unitary operator U on H. Since Ux, E a Vrz, 
G=1,..., k), U and Vr coincide in their action on Ê, and 


UH 79, = VHT? = To. (j=-1,..., n). Thus 
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2 | LTY, p>|P = E CUHK, y)? 
su h Tr 
SAAE A 
SE aA R TATA 
> CH, 9, oP y 

[E CH U*W , U*y >P], 


il 


Since UN ys, arity U* yr is an orthonormal system in H, it follows, 
from Lemma 2.1.2(ii) and the expression (4) for ITI, in terms of the 
eigenvalues of H, that the final quantity in the above series of 
Inequalities does not exceed ||T| i . Hence condition (6) is 
walisfied, with M = ITI 

Finally we show that, if M is any constant which satisfies (6), 
then M > |IT| lo" We may suppose that T is represented as in 
equation (2). Then (9) and CZ) are orthonormal sequences and 


Ip = #Y,, For each m we have 
m m 
M >I z ILT? y PI? zi z pP]! /e, 
n= n = 


Ilence M >|} EIE = TIl 


COROLLARY 2.3.5. For each T in Cy Werle = Tilo 


COROLLARY 2.3.6. Suppose thatl<p<», T€ Cy and QA, 
ix the sequence of non-zero eigenvalues of T, counted according to 


their algebraic multiplictties. Then 


[Z A [PIP < {III 


90 COMPACT NON-SELF-ADJOINT OPERATORS 


Proof. By Theorem 1.8.6 there is an orthonormal sequence (P) 
in H for which CTO, , Pa » = A3 and 


[2 | <Te,, o o e III, 


since, by Lemma 2.3.4, each finite subsum is not greater than UTIL: 


Let f be the set of all operators of finite rank on K. When 
x, y € K, we define x Q y in Í (as at the end of $1.9) by 


(x @ y)jz= <z, y>x(zé H). If iP : j € J} is an orthonormal basis 
of K, 


PETE E 9> 
J 
T È Co, y> Lx, P, > 
J 
<x, YD, 


by Parseval’s equation. We claim also that 


I} 


Ix e yll = lxil [lyzl] (<p < ~). 


For this, it is sufficient to consider the case in which 

I|x|| = |ly|| = 1. In that case, the discussion at the end of $1.9 
shows that, if x © y has polar decomposition VH, then H = y @ y. 
Hence the only eigenvalue of H is 1, and its multiplicity is 1. By 


(4), Ilx @ yll, =1 (1< p< %). 


We recall from $1.3 that, if 1 < p< ow, then the index q conjugate 


1 


to p is defined by p` yS = 1, with the obvious interpretation when 


pis 1 or ©, 


LEMMA 2.3.7. Suppose that 1 < p< œ, q is the index conjugate 


to p, and T eR). Then T € Cc if and only if 
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(7) sup {Ir(FT)| : F € F; F Na <1} < 0, 
When this is so, the value of the supremum 1s IZI 


Proof. Suppose first that the supremum in (7) is a finite 
quantity M. We shall prove that T € C, and ITIL <M. Inthe case 
p oo, itis given that T € RH) = see and, if x and y are unit 
vectors in H, then x @ y € Ī, ||x @ y||, = 1, hence 

M > |r ((x @ y)T)| = |r(x @ T"y)| 
Keely) 
|CTx, y>|. 


By taking the supremum of the right-hand side over all unit vectors 


It 


vand y, we obtain M > ||7|| = ||T||_. We now turn to the case in 
which lg p<. Let (Q, ..., 9) and (W, .. ww) be orthonormal 
systems in K and let a, = LTO UD: We shall prove that 
X |u |P < MP, whereupon Lemma 2.3.4 shows that T € Cs and 
s < M, as required. By deleting any Ọ, and y, for which 
CEUs wi» = 0, we may suppose that each a, is non-zero and 

s = 
define B, = a, CAE Let 


n 


x= a p~1 X, ’ 
B oo jap? C&D Bkk 


n 


Hx= E lal kx >t, (eH. 


‘Then B c f, and it follows from the discussion at the end of $1.9 
that B has polar decomposition of the form VH. Since (p-1)q = Pp if 
1 <p < œ and the eigenvalues of H are a. ae algae it 


follows from equation (4) that 
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n 
(8) Bll, = JL =. jo, Pra if 1 <p <-o, 
5 ked 
e+ 1 if p = 1. 


For each x in K, 


BTx 


It 


2 a e <Tx, vB, 
> |a, |P71B, <x, T* Y, > ; 


so BT => la? BO: Q T*w,) and 


tI 


(BT) = È |a,|° 1B, <P TY? 
> a Bi 


= 3 jai’. 


lI 


With F = IIB||Z*B, we have F € § and ||F||, =1. Since M is the 


supremum in (7), it follows from (8) that 
n 
M >| (FD)=[ È Ja Pr. 
k=1 
This completes the proof that, if the supremum in (7) is a finite 


quantity M, then T eC and | Til, < M. 
Suppose conversely that T € Cy We shall prove that the 


supremum in (7) is finite and not greater than HZI For this 


purpose, suppose that F € Í and NF, < 1. If VA is the polar 


decomposition of F, there exist positive Uys voy Ay and orthonormal 
systems (0,, ..., 9,) and (pyres W) such that 
n 
Hx= X a<% Pr Py? 
n 
Fx = r a a, Z, P V, (x € Í). 
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Since HE Ilg < 1, it follows that 


(9) 


IM 3 
Q 
Q 
N 
pad 
pm. 
me 
paad 
ix 
IQ 
AN 
2 


lor each x in K, 
n 
FTx = i = a). <Tx, > Vy, 
n 
* 
= : a ar Cx, T D> Wr 


no FT =% a, (ys, Q T*0,) and 
T(FT) = ps a, [Yy T*0,> E >, a, Bp 


where B, = [LTY O,>- By Lemma 2.3.4, 


: p]1/p < Hr if 1 <Ṣ < oo, 
(10) a ee í 


[Bk <s Zis G@=aL.-. n) if p = ». 


l'rom (9) and (10) it follows, by use of Holder’s inequality, that 


[7(FT)| ag i ae a Pyl < ITIL 


We deduce that, if T .€ Cy then the supremum in (7) is finite and not 
greater than ||T| la This, together with our previous result in the 


opposite direction, completes the proof of the lemma. 


THEOREM 2.3.8. Suppose that 1 < p < œ. Then e is a 
norm on Cy and with this norm C; is a Banach space. The set $ 
of all operators of finite rank on H is an everywhere dense linear 


subspace of Co 
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Proof. Since ris a linear functional on G and, by Lemma 2.3.7, 
ITIl, = sup IE DI : F eF, IFI < 1 
for each T in Cy it follows*that 
20, HaTi = lel Tilp ISen < ISIT 


for each S and T in C and each complex number a. If ITI =0 
then by (5) |!T|| = 0, and so T = 0. This shows that ls is a norm 
on C 

p 

Suppose that (T) is a Cauchy sequence in a Given a positive 


e there is an integer V(¢) such that 
(11) e < e (m,n N(8)). 


Hence UTT < e whenever m, n > N(e), so (T,) is a Cauchy 
sequence in PG) and converges, with respect to the norm on B(H), 


to some T in B(H). If x, y € H then 


r((x @ y)T,) 


r(x @ Try) 
CTX y> 
Tx, y? = 7((x@ y)T) 


It 


4 


as n => œ. It follows, from the linearity of 7, that T(FT,) — T(FT) 
as n > œ, for each F in f. With F in Í and a <1, Lemma 2.3.7 


and (11) imply that 
EET eF | < e (m,n N(e)), 
and by taking the limit of the left hand side as n — ~ we obtain 


I (FT _-FT)| <e (m>N(e)). 
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lence 
sup {|7(FT_-FT)| : F€ J; HEI < 1} < € 
when m > N(€), and Lemma 2.3.7 shows that 


To ety (eee < € (m > N(é¢)). 


It follows that T € C and that Ta converges to T with respect to 
the norm on C This proves that C, is complete. 
We show finally that each T in Ge lies in the closure of f. We 


may Suppose that T is represented as in (2), the summation being 


over all positive integers. For m= 1, 2, ..., define L in f by 
m 

(12) TX = 2 hy o>w, eH. 

Then 


(T-T = È by p >y, (eH). 


Ry applying (4), with T-T „ in place of T, we obtain 


irisi 3 l a 
TAR n =m+i a: 


. ° CE 
us m— œ. Hence T lies in the closure of f. 


Our next objective is to prove a generalization of Hölder’s 
inequality (Theorem 2.3.10). The following lemma is subsumed in 


Theorem 2.3.10 but is needed in its proof. 


LEMMA 2.3.9. Suppose that 2< r, S<», elar hes; 


K: C andS€C.. Then RS € ©, and ||RS||, < HRILHSIL 
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Proof. If r= s= œ then t= œ and the result is trivial, so we 
shall assume that at least one of rand s is figite and hence that t 
is finite. Let a= rtt, þ= st}, and note that a and b are conjugate 
indices. We give the proof for the case in which 1 < a< œ, and 
leave the reader to supply the simple modifications required when 
a=lorc, Given any finite orthonormal systems (,, ..., p) and 
(Pi ee ws) in H, we have 


n n 


= |CRSY, y> = = |<sp, Ry > 
1 j=1 


n 


< = Isel lR w 


<E Iso |PVALS rty 
j=1 7 j=l J 


* s/2}1/s 
<S*50,, 0, 78/7] 
n 
xL E <RR*Y,, P 
j=1 


Now suppose that (p) is the sequence of non-zero eigenvalues 
of (S*s)i/2 counted according to their multiplicities, and note that 
u2) is the corresponding sequence for S*S. Since s/2 > 1 and, by 
DE = ||S||$, it follows from Lemma 2.1.2 that 
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n 
S <sts, 9>*/? < IISI; 
pea 


und a similar argument shows that 
* ft 
E <RRY, >? < IRE = IRIN. 
| 
These inequalities, together with (13), yield 
n 
[L E [<Rso, yT < URU SHa 
j=1 


“ince this has been proved for any finite orthonormal systems 

Cr P ) and (uj, <- Y) in K, it follows from Lemma 2.3.4 
eee Oe 

that RS € Ĉ, and ||RSI|, < [IRID HSI s 


-1 -1 -1 
THEOREM 2.3.10. Suppose that 1 <r, s, t<%, © =r +S , 


R eC andS éC.. Then RS eC, and ||RS\|, < IRI, LISI... 


Proof. We deal separately with three cases which together cover 


all possibilities. 


Case 1. 1, 8>2. In this case the result has already been 


proved in Lemma 2.3.9. 


Case2. 1<s<2. Suppose first that S € 5, Let p and q be 
the indices conjugate to s and t, respectively; note that 
gular tis! = 1, whence q, r > 2 and quar} = pt, 
F eS and IF|] <1. By using Theorem 2.2.4 (iv) (with FR in 


place of A), Lemma 2.3.7 (with s in place of q) and Lemma 2.3.9 


Suppose that 


(with q, rand p in place of r, s and t, respectively) we obtain 


I7(FRS)| = |r(SFR)| < (Si, FRI, 
< ISI, HEH HRI, < HRL ISH, 
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it 


Since this inequality is satisfied whenever F, € F and IE i <1 
follows from Lemma 2.3.7 (with t in place of p) that RS € a and 
RSI < HRI HSI 

We have proved the theorem in case 2, subject to the additional 
assumption that S € F, We continue considering case 2, but now 
with a general S in ae By Theorem 2.3.8 there is a sequence (S) 
in F for which ||S-S_||, - 0. The result just proved shows that 
RS, € Cp |[RSAII,< IRI, tS, 11, and 


RS, RSH, S RUNS Sal ls aN 


as min (m, n) — œ. Since C, is complete, the sequence (RS) 


converges with respect to ||-| P to some T in C We have 
||RS,-TI| < ||RS,-Tl|, > 0, 
HRS -RS|| < HRI SaS < [IRI] Sasis > 95 
so RS = T Et. and 
RSI], = VTi, = lim [RS]; 
< [Ri lim [Sls = RILIS 
This completes the proof of the theorem in case 2. 


Case 3. 1xr<2. Since S* € i. and R* € A we may apply 
the known result in case 2 (with r and s reversed) to obtain 
S*R* € C and |S*R* |, < S|). RÝ, . Hence RS € C, and, by 
Corollary 2.3.5, RSI], < HRI], HS) = 

This completes the proof of Theorem 2.3.10. 


COROLLARY 2.3.11. Suppose that A, B € BCH) and S € oF 
Then ||ASBI|, < {|Al| JISH LBL 
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Proof. By Theorem 2.3.10, 


ASBI < [AN SSB, 


HHBH = HAT TSH TBI 


< {All LIS! 
We now identify the dual space of the Banach space Se where 


b, & poses: 


THEOREM 2.3.12. Suppose that 1 < p < œ and q is the index 
conjugate to p. Then for each T in A the equation f (S) =7 (ST) 
(Sic C a defines a continuous linear ae on Cs Furthermore 
the mei T- fr is an isometric isomorphism from a onto the 


dual space Co : 


Proof. Suppose that $ € a and T € Co By Theorem 2.3.10 and 
Lemma 2.3.3 we have ST € C and 


DI < IISTI < HSH UTIL: 


It follows that, for each T in ae the equation f 7S) = (ST) SE Q p) 
defines a continuous linear fannd fp on C , with ||f,1|< <\IT\Iq- 


By Lemma 2.3.7 we have 


Ell > sup tf,(F)| : F es, Fil, < 1 
= sup [r (FD: F ef, Poe As 


= |TV. 


so ||f,|| = NTMI, Since the mapping T — fp from C, into Cy 
paws linear, it remains only to prove that its range is the whole 
of (C, js 

Let f be a continuous linear functional on C , and (using the 
notation introduced at the end of $1.9) define ne y) = f(x Q y) for 


each x and y in HK. Since L(x, y) is linear in x, conjugate linear in y 
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and satisfies 


L(x, y)| = ee y| < HALI @ yl], = HAL Hl Iyl 


2 


it follows from Theorem 1.7.1 that there is an element T of B(H) 
such that L(x, y) = <Tx, y>. Hence 


<x, T*y> 


r(x @ T*y) = r((x @ y)T), 


li 


f(x @ y) = Tx, y> 


il 


and so (F) = (FT) for each F inf. Thus 
sup {|r(FT)| : F €, IFI < 83 < Il < œ, 


and by Lemma 2.3.7 we have T € Cy Finally, (F) = 7(FT) = f F) 
for each F in Í and, since Í is dense in Co it follows by continuity 


of f and f, that f = f}. This completes the proof of the theorem. 


It follows easily from Theorem 2.3.12 that the Banach space C 
is reflexive if 1 < p< œ. For Ey can be identified with Cy and 
(ay can be identified with Ca these identifications give rise to an 
isometric isomorphism from A onto Cy and a routine 
manipulation shows that this isomorphism is the canonical one 
described in $1.1. 


THEOREM 2.3.13. Suppose that 1<p<-~, q is the index 
conjugate to p, S € C; and T € Co Then ST, TS € C and 
T(ST) = r(TS). 


Proof. Since the roles of p and q can be reversed we may 
assume that p £ œ. Then f is dense in Cy and so, given any 
positive e, there exists F in f for which ISEI < e By Theorem 
2.3.10 and Lemma 2.3.3 we have ST, TS € Ci 
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\r(ST-FT)| < ||ST-FT||, < S-F HTI < Tg 


and similarly |r (TS-TF)| < elITilg By Theorem 2.2.4 (iv) 
ET) = t(TF); hence |r(ST)-7(TS)| < Zell Tilg and, since € is 
nihitrary, 7 (ST) = 7 (TS). 


‘The most important cases of Theorem 2.3.13 are those in which 


p land q = (already included in Theorem 2.2.4) ot p = q = 2. 


2.4. The Schmidt class 

DEFINITION 2.4.1. The ideal Č, in RJO is called the 
Schmidt class of operators on K. 

We recall that Ca is a Banach space with respect to the norm 
||-||, described in Definition 2.3.2. The following theorem shows 
that Cs is, in fact, a Hilbert space, and has an orthonormal basis 
consisting of operators of the form x @ y (the notation being that 


introduced at the end of $1.9). 


THEOREM 2.4.2. The Schmidt class C is a Hilbert space 
with respect to the inner product | , | defined by [S, T] =7(T*S) for 
each § and T in Co. The norm derived from this inner product 1s 
IRIRI 

If io, : jeJ} and Ww, : j € J} are orthonormal bases in H and 

J 


Mk =9, 8 wy, then Fs g : j, k € J} is an orthonormal basis in Cx 


Proof. It is clear that [S, T] is linear in S and that its conjugate 
complex number is [T, S]. If V His the polar decomposition of an 


clement T of C then 


iz, T = aT = a = Files 
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Since IRI binom under which e, is complete, it follows that 2 When these conditions ee the sums occurring in (1) and 
is a Hilbert space with | , f as inner product. (11) are both equal to ||T| p and 
Since 
T= X% CT, PF, , 
peer Ye Ni 


FY ic Fi m = r OG @ Ya) 


= <H, P> y @ Pa) 


the summation converging with respect to ||-||,. 


Proof. Since Tyl x |<Ty,, > |? it follows that 
ped 


we have 
1) 2 ies >. Sito)? 
Lg oe Fjyl = 7s, Pen) neg Ek" “ey pes a 
~ Cp, v> Cu, p > ((hese sums being finite or infinite), so (i) and (ii) are equivalent. 
2 


E lera ee li T € C then, since up :j, k € J} is.an orthonormal basis in 


| 


0 otherwise. y 


(2) irii 7 


= |T, F, ,] 
Peer ee 


Hence aa : j, k € J} is an orthonormal system in Cy Note that, 

> 2 
x | CTY 0 >| 3 
k€ J 


if Te Ca then 


li 


j, 


[T, F.,] = r(F*, T) 
i,k j,k (3) Pe x IT, F, KF, k 
= r(Y, @ T*0) = <TU,, 0). ena 
j k? 'j 
= F Pi 0 PE g 
If T is orthogonal to each F, g then <Ty,, o> = 0 for all j, k in J; j,k 
Since T is continuous, and finite linear combinations of ws, or of where the summations in (3) converge with respect to hele in 
0,’s, are everywhere dense in H, it follows that CTx, y> = 0 for particular, it follows from (2) that (ii) is satisfied, so (iii) implies 
each x and y in H and hence that T = 0. Thus ÍF. pu Reis (ii). 
j» 

an orthonormal basis in oe Suppose conversely that T € B(K) and (ii) is satisfied. With 


: ra 2 
Ce CTU >; we have È |c; l < œ, hence z E 


THEOREM 2.4.3. Suppose that T Ee BCH), tO, : j € J} and j,k e ns 


` 7 A D a 
bys : j € J} are orthonormal bases in K, and F, ames Øy Then GER Macey HHT aa gee Mee 
, Ta ; 
the following three conditions are equivalent. IS, Bik! me a Tee Since 
(1) >, Ty. 2 < œ, 
ae 2 
(11) i, ee J K7 p, >| S Sa: for each j and k in J, it follows that T=S € Cy Hence (ii) implies 


(iii) T€ C,. (iii). 
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We have now shown that conditions (i), (ii) and (iii) are 
equivalent and that, given these condition, equations (1), (2) and (3) 


are satisfied. This completes the proof of the theorem. 


It follows at once from Theorem 2.4.3 that, if T € BCH) and 
either (i) or (ii) is satisfied for one particular choice of the 
orthonormal bases involved, then both (i) and (ii) are satisfied for 
any choice of these orthonormal bases. 

We now give a simple description of the Schmidt class of 


operators on an L, space, 


THEOREM 2.4.4. Let H = L, (E, u) and, for each h in 
L, (E x E, px p), let T, be the associated integral operator on H 
defined by 


(T, Ð (s) = A h(s, t)f(t)dp(2). 


Then the mapping h —> T, 1s an isometric isomorphism from 
LE x E, ux u) onto oF 


Proof. Let ip. : j € J} be an orthonormal basis in L CE, u). We 
may define an orthonormal system iW g :j, ke Jhin LE x FE, xp) 
by Yý, (5 Ù= AOO and for each h in L,(E x E, p x p) we have 

CTP G = SS rls, 09,(O0,(s)duDduls) 
Chi >: 


It follows that, if Ch, or D = Q for each j and k in J, then T, =0 
and hence (see $1.8) h(s, t) = 0 almost everywhere on Ex E. Thus 


tw, g: J & € J} is an orthonormal basis of LE x E, p x p) and, for 
each A in LE xE, px p), 
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2 = > h, l 2 
IANO S ag bie 


H 


5, To wN. 
To > | 


ty Theorem 2.4.3, T, € G and ||T il = ||Al|. It follows that the 
mapping h — T, is an isometric isomorphism from L,(E x E, p x u) 
onto a subspace M of C Since L, (E x E, p x p) is complete, the 
same is true of J, and thus JM is a closed subspace of Ca It is 
cusily verified that T, = 0, 8 , when h = Ui g so 9,@ P€ M. 
By Theorem 2.4.2, 10, QO: j k € J} is an orthonormal basis of 
Cai so M = Cai 

We now revert to the study of the Schmidt class of operators on 


a general Hilbert space. 


THEOREM 2.4.5. Suppose that T € C, and (A,) is the sequence 
of non-zero eigenvalues of T, counted according to their algebraic 
multiplicities. Then 


(4) Sian LIS: 


n 


equality occurs if and only if T is normal. 


Proof. The inequality (4) is a special case of Corollary 2.3.6. 
However, in order to determine the conditions under which equality 
occurs, we need a different proof of (4). By Theorem 1.8.6 there is 
an orthonormal sequence (Y) in H such that (Ty, J > =A,- Let 
tO; : j € J} be an orthonormal basis of H which contains each Y,» 


and define K={j¢ J :-, is one of the y ’s}. Then 
2 

2 LTY x VADA 

n 


> Tos 9, >|7 < 
oes Pr K? j 


it 


ZAT 


li 


= |<To,, 9>I?. 
keJ 


2 
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By Theorem 2.4.3 the last quantity is Heals so (4) is satisfied, and 


equality occurs in (4) if and enly if 
(5) <TO,, Y> = 0 unlessj=k €K. 


If equality occurs in (4), we deduce from (5) and Theorem 2.4.3. 
that 


kEK 5 


the series converging with respect to ||-!|, and so also with respect 
to || ||. Hence 

a 
(6) frs z àn LE Woe. REI, 


and T is normal. 

Suppose conversely that T is normal (and, of course, compact, 
since T € C3): We can assume that the orthonormal sequence (wy) 
was chosen, at the outset, so that (6) is satisfied, since this 
implies also that (TW, YW, >= A. It follows easily from (6) that 


(5) is satisfied and hence that equality occurs in (4). 


The foliowing result is an immediate consequence of Theorems 
2.4.4 and 2.4.5. The first part is usually known as Schur’s 
inequality, and the second part is due to Goldfain [23]. 


COROLLARY 2.4.6. Suppose that h € LAE x E, u x u) and 
that T; 1s the associated integral operator on L(E, u) defined by 


(T O(S) = J As, DEOD. 


Let (A) be the sequence of non-zero eigenvalues of Ty counted 


according to their algebraic multiplicities. Then 
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(i) SIA < Sf [ws D dOd; 


n 


(ii) equality occurs in (1) if and only if T; is normal. 


In view of Theorem 2.4.4, it is natural to ask for a simple 
description of the trace class operators on an L, Space, but it 
vecms that no such characterization exists. Suppose that k(s, t} is 
continuous on the square [0, 1] x [0, 1], the associated integral 
operator K on L,(0, 1) (Lebesgue measure) is self-adjoint, (A) is 
Ihe sequence of non-zero eigenvalues of K (counted according to 
their multiplicities) and (9) is the corresponding orthonormal 
sequence of eigenfunctions. It is not difficult to show that each 9 
iss continuous [58: p. 22]. If K-is a positive operator (in the sense 
of 31.7), a classical theorem of Mercer ([48]; see also [58: p. 128)) 


asserts that 


k As, t) = 2 A 0, (9)9,9, 
the series being uniformly convergent for s, t in [0, 1]. Thus 
1 9 1 ; 
DA = D f vn VEO] ds = i k(s, s)ds < œ. 
n a n 0 0 


It follows that K lies in the trace class (but this conclusion fails, 


in general, if we omit the assumption that K is positive). 


CHAPTER 3 
Fredholm Theory for Trace Class Operators 


3.1 Introduction 
The classical Fredholm theory is concerned with the solution of 


integral equations of the form 
b 

(1) f(s)-AJ h(s, Of(Ddt = g(s) (a<s<b), 
a 


where À is a complex number, La, b] is a compact real interval, and 
the known functions g, h and the unknown function f are all required 
to be continuous. For a full account of this subject, we refer to 
[43]; in this section we are concerned only to describe certain 
general features of the theory. The solution of equation (1) is 


expressed in terms of an entire function 

(2) d(à) = 1+2 d A" 

called the Fredholm determinant of the kernel h, and a kernel 
(3) D,(s, t) = z D,(s, DA”, 


the first Fredholm minor. It can be obtained, heuristically, by 
considering (1) as the limiting case of a finite system of linear 
equations in a finite number of unknowns; the functions d(A) and 


D)(s, t) are the limits of certain determinants which arise when this 
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system is solved by Cramer’s rule. The series in (3) converges for 
all s, tinla, bj and all complex A, uniformly when A is confined to a 
bounded subset of the complex plane. When dÀ) 4 0, equation (1) 


has a unique Solution given by 
(s) = as) AX f° DMs, DDA 
s) = S)+— S, Ue t)dt. 
EST O a Se 


If a(A) = 0 there is a non-trivial solution of the homogeneous 


equation 
b 
f(s)-A | Als, Df(Ddt = 0 (a< s< bD); 
a 


the conditions under which (1) has a solution, and the form of the 
general solution when it exists, can then be expressed in terms of 
Fredholm minors of higher orders. 

There are explicit formulae for d and D (s, t) in terms of h; for 


example, 
b 

(4) dps -Í| hls, s)ds. 
a 


By analogy with matrix theory the integral in (4) can be regarded as 
the ‘trace’ of the kernel A. It turns out that each of the coefficients 
d_ can be expressed in terms of such traces of A and its iterated 
kernels and that there is a similar expression, involving both the 
kernels and their traces, for D, These formulae for d, and D: were 
given by Plemelj [50]. By use of closely analogous formulae we 
shall introduce the Fredholm determinant and the first Fredholm 
minor of a trace class operator in $3.3. We hasten to add that, if 
T, is the integral operator in L (la, bÌ) associated with a 


continuous kernel A, then T; need not be in the trace class. 


FREDHOLM THEORY 111 


If f, g and h are required only to be of class L,, the theory just 
described requires modification; for example, the function A(s, s) 
can fail to be integrable and equation (4) is then meaningless. 
Carleman [11] showed that, if h is redefined to be zero at all points 
of the form (s, s), then the classical Fredholm formulae remain 
valid; a simpler proof of this result was given by Smithies 
|57: see also 58]. Appropriately modified versions of Plemelj’s 
formulae for d and D, play an essential part in the method used by 
Smithies. In both the classical Fredholm theory and its modification 
for L, functions, it is possible to obtain estimates for the rates of 
growth of d(A) and D, as |A| — œ. 

Our purpose in this chapter is to provide a Fredholm theory for 
equations of the form x-ATx = y, where T lies in the trace class of 
operators on a Hilbert space H and x, y € H. Such a theory, valid 
when T lies in an arbitrary von Neumann-Schatten class, has been 
expounded by Dunford and Schwartz [16 : XI 9]. Although the 
Fredholm formulae given in this chapter apply only to trace class 
operators, they can be used to obtain sharp estimates for the rate of 
growth on the resolvent of a quasi-nilpotent operator in a general 
von Neumann=Schatten class (see Theorem 3.4.6), Our method is a 
modification of the one used by Smithies [57, 58]. 

We note that a Fredholm theory for trace class operators on a 
Banach space has been developed by Ruston [53, 54], and that 
Grothendieck obtained similar results for trace class operators 
acting on locally convex spaces [24]. A sharp estimate for the rate 
of growth of the Fredholm determinant of a trace class operator 


acting on a Hilbert space was given by Lidskii [39]. 


3.2. Fredholm formulae for operators of finite rank 


In this section we develop a Fredholm theory for an operator of 
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finite rank on a Hilbert space H. In $3.3 our results are extended, 
so as to apply to a trace class operator, by an approximation 
process. 

We introduce some notation which will be used throughout $3.2. 


Let T be the operator defined by 


n 
(1) Tx = = m l O>Y, ( e KH), 
where (9, err 9) and Wy, er wv) are orthonormal systems in H 
and ,,..-, H, are positive real numbers. The discussion at the 


end of $1.9 shows that each operator of finite rank can be represented 


in this way. We denote by M the subspace generated by Pio ee @, 
Pjs- Yp Since Tx is a linear combination of Wires Ww, for 
each x in K, and Ty = 0 if y is orthogonal to each of Oy, +++ Pa it 


follows that 


(2) TxeM (eW, Ty=0 Q eM); 
we denote by ph the restriction of T to M. We extend (94, ee P) 
to an orthonormal basis (P; ae P) of M, and define a k x k matrix 


A= la, (J by the condition 
k 
(3) TQ, = boats UEL ei K): 
Thus A is the matrix of the operator pk with respect to the basis 
(,, Sunes Pp) Since 
k 
2 2 
IKAI a a la,,| 
and, by (1), T9, = Bp or 0 according as j <n or j> n, it follows 
that 
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k 2 ALIS: 
a ae ae 
enuo 0 Mm<j<k). 


For each complex number A we define 
(5) d(A) = det (I-AA), 


the determinant of J, -AA, where I is the k xk identity matrix. The 


I 
k 
transposed cofactor matrix of I,-AA will be denoted by 

B(A) = Lb, AI, so that b, (A) is the (j, i) cofactor of I -AA and, by 


elementary matrix theory, 
(6) (I -AA)B(A) = B(A) (I-AA) = d(A)L, « 


We introduce a k x k matrix C(A) = Le; (0) and an operator D) on K 
by the equations 


(7) C(A) = ABQ), 

k 
(8) Dx = T i c; (À) <x, PY; (x € H). 
Then 


k 
Dy, = fe CA); 


and Dyy = 0 whenever y is orthogonal to each of 9,, ..., @. It 
follows that 


(9) DyxeM (xeM, Dyy=0 (eM), 


and that the restriction pit of D à to M is the operator whose matrix 
with respect to the basis 9, ..., P, is C(A). 
Although d(A) and Dy are polynomials in A, the latter having 


operator-valued coefficients, it is more convenient for our purposes 
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to express them aS power series by inserting additional terms with 


zero coefficients. We therefore write 


(10) da= È dA™d, Dy = È AD. 
m= 0 m= 0 
LEMMA 3.2.1. For each complex number À 
(11) (A(A)I+AD y ) (I-T) = (IAT) (A AM+AD)) = AAD. 


Proof. It follows at once from (2) and (9) that each of the 
operators occurring in (11) leaves invariant the subspaces M and MS, 


and that 
(KAMAD y ) d-AT)y = U-AT) (a(A)I+AD) Dy = d(A)y 


when y € JI”. It remains to prove the equations obtained from (11) 
by restricting each operator to M. For this purpose it is sufficient 


to derive the corresponding relations 
(12) (CAAM FAC(A ))U, AA) 
= ,-AA) (d (AMI TACAN) = day, 


for the matrices of these (restricted) operators with respect to the 


basis Oi, +++) Qpe By (6) 
dA), tAC(A) = dA, +AAB(A) = BO); 
and this, together with another application of (6), yields (12). 
The significance of Lemma 3.2.1 is that, when dÀ) Z 0, [-AT 
has an inverse given by 


(I-AT)~* = ALAANI ID. 
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Our next objective is to obtain expressions for d- and D_, the 
coefficients in the expansions (10) of d(A) and Dy, ina form which 
remains meaningful when T is replaced by a general trace class 


2 j j d an 
operator. FOr this purpose we need some further notation and a 


auxiliary result. Let 
(13) pa). es aa 


and suppos€ that the expression for the polynomial KA) =det Q, A) 


as a product of linear factors is 


(14) d(A) = (1-A9,) (1-AB,). . . -AG,). 


i j ted 
Thus 0. ror Oi are the eigenvalues of the matrix A, coun 


according tO their algebraic multiplicities. 


LEMMAS Bee, Ou Gi +5+ sree de +07 (r = 1, A ae o). 


f 


Proof, Suppose that S is any operator on H which leaves M 


invariant and annihilates M”; and let P = lp, 1 be the matrix, with 


respect to the basis 94, ..., Po of the operator M obtained by 

2 H 
restricting Sto M. Let ig, - j € J} be an orthonormal basis of 
which contains each of 94, ..., Pg. Then S has finite rank and se 
lies in the trace class, Sé, = 0 except when 3 is one of Ọ4, ..., Das 


and therefere 


AS): S. XSE5-6> 
j EJ ] J 
k k 
z2 CS0, 02s > ii 
ix 1 —_— 


It follows from elementary matrix theory that 7(S) is the sum of the 
eigenvalues of P. If we take S = T’ then gM is (rh and its mat!ix 


P is A’. The eigenvalues of P, counted according to their algebraic 


` t r r 
multiplicitl€s;, are O; Oi s.s Os and so 
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7(T*) = 01 +0+. seh 


LEMMA 3.2.3. The coefficients in the expansion (10) of dA) 


are given by d) = 1 and 


(is) a -6R |7 
a m! 


Ta O 


SRE aD pea 
O 
m Tea neo 


Proof. By differentiating (14) with respect to À we obtain 


gm. 3 A 
dA) j=1 1A0, 
k oo 

a D E i 
j=lm=0 

an A Oat 


this expansion being valid for sufficiently small À. Since 


day= 5 Ad, 
m= 0 m 
we have 
È m™!¢q --( = A™_ 


-$ 


JQ) = È m™!q, 
m 


m = 0 


aa 


By equating coefficients of \'~! we obtain 


r 


td = (doo +d,o 1+- re +d 49). 
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Hence 
d, = -d o7]? 
dia, + 2d, T -dof 
(16) d;a, + d,a, + 3d, = —d)0., 
dia td mtd mgt +4, 1% td, = A m 


It follows from (5) that dọ = d0) = 1. By applying Cramer’s rule to 


the system (16) of linear equations in dyse. d we obtain (15). 


LEMMA 3.2.4. The coefficients in the power series (10) for PA 
are given by Dy = T and 


7 laj 1 0 saw O 0 
a7) D -G&D |") (m > 1), 
me m! T, u 2 wae 0 0 
02 5 Oy 
O pee | Imap i m 
getl ym yml  , T? T 


where the determinant is to be interpreted as the operator obtained 


by formal expansion. 


Proof. We begin by remarking that, because of the uniquenes? 
assertion in Theorem 1.4.1, the process of ‘equating coefficients’ 
is valid for power series with operator-valued coefficients (in 
proving the present lemma we shall in fact be dealing with 


polynomials). It follows from (11) that 
Dy = ATD)+d (A)T, 


and by equating coefficients of powers of A we deduce that 
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j 
(18) Do = T; D a = ID osi aat (m > 1). (19) v; = = Las (j = i Sy k) 
The recurrence relation (18) determines D_ for allm=0, 1, 2,... TE E eres t (1 <r<j<k. If w, is the vector 
Let oe denote the right-hand side of (17) when m > 1, and define ee a) haces E 3 -0 when r> j, then (19) asserts 
E, = T. If we expand the determinant in (17) by its last column we i i 

1a 
find that k 
ee (ee, Hag Sl ea 
E = TE__,+d,T (m > 1). Ei gg IE 


Hence E_, as well as D satisfies (18); and therefore Hence P = UT, where U is the unitary k x k matrix CAE and T is 


D n = E n (m = 0, 1, 2,...). the upper-triangular k x k matrix [tat Since T = U*P we have 


2 a 2 
In $3.3 we shall use equations (13), (15), (17) and (10), in that lt, ~ j A aa 
order, to define the Fredholm determinant d(A) and the first Fredholm f : 5 k > 
minor Dy for a general trace class operator T. In that context, d(A) = ns = 1 1a C = lIP;;! ) 
D, need not be polynomials; and, in order to show that they are i - i 2 


entire functions, we require estimates for the orders of magnitude of j 


the coefficients d and D n We shall also be interested in the rates 


Ifence 
of growth of these functions as |A| — œ. For the present we continue | 3 2 
l . : nR . Idet P|? = |det U|*\det T| 
studying the case in which T has finite rank, and in Lemmas 3.2.6, 
2 
3.2.7 and 3.2.8 we obtain the appropriate estimates in this case. = |det T| 
k 
2 
LEMMA 3.2.5. (Hadamard’s inequality). Let P = Le; be a = t It; 
k x k complex matrix. Then k k 
2 
j= 1 ił 


k 
ldet P|? < T ( 2 [p^ 
j=l i= 1 J 


In the next few lemmas, the positive integer n and the positive real 


Proof. The result is apparent if det P = 0, so we assume l 
numbers p,,.--, #4, are those occurring in (1), and Ls is defined by 


henceforth that P is non-singular. The ‘columns’ 


vs (Py) Pop ree Py j) (j=1, ..., k) of P form a basis in the (20) Ee = sup s Plog(i+s) (0<p<1). 
Hilbert space CË of k-tuples of complex numbers. By applying j 
the Schmidt orthogonalization process to v,, ..., v, we obtain an Since s™P log(1+s) is continuous on the open interval ©, ~) and has 


orthonormal basis War eeo Wy of CË with the property that limit zero as s tends to either endpoint, Pa is finite. 
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Furthermore 
(21) lis < expl s) (s>0; 0<p< P). 


LEMMA 3.2.6. Suppose that r is an integer such that 1 <r<n, 
andQ<p<l. Then 


n 


22 a < T Alap, 
J =a 
(23) dQ] < expa] E ap T Alp, 
j=r+i j=l 
(24) dO] < exp(lAl E py) 
J = 
(25) IA] < exp(T|A|P > W) H GAlp,), 
j=r+i j=1 
(26) d)| < exp AP E pP). 
j=l 


Proof. Since dÀ) = det(I,-AA), Hadamard’s inequality gives 


k 
|a|? < D, AL fa, [PHA a;l +. 


+lt-aa, | +. HA ala 
k k 
< T MPN ade S. al 
pad if j=. 4 
It follows from (4) that 


IANI? < T de 2jAlw lAl as 3 
jad 


as 


2 
e (1+|A|p;)°. 


This proves (22). Since l+s< exp (s) whenever s > 0, the factor 


1+|Alu, in the right-hand side of (22) can be replaced by exp(A |p); 


FREDHOLM THEORY 121 


hy making this change for appropriate values of j we obtain (23) and 
(24). In the same way (25) and (26) can be deduced from (22) by use 
ol the inequality (21). 


LEMMA 3.2.7. Suppose that r is an integer such thatl<r<n 
mdQ<p<i1. Then 


(27) IAD, || < exp H GelAla,) 
J = 
(28) ||MAYAAD)|| < expC] E p) H Glg, 
j=rtl j=i 


(29) ||AAM+AD || < expalA] = bj)» 
ee 


r 


(30)  [[AAAD]] < expC AP E E) WE AHA) 
j jertt J j=1 i 
(31) [dAYEHAD) || < expa T AP E pP). 
j= 


Proof. It is sufficient to prove (27), since the other results 
follow from (27) in the same way that the last four inequalities in 
Lemma 3.2.6 were deduced from (22). By (9), LAMAD; leaves 
invariant both the subspaces M and I~; so, in order to prove (27), it 
suffices to establish the corresponding inequalities for the operators 
obtained by restricting AMAD, to these subspaces. Since Dyy = 0 
when y e W`, the restriction to I” is d(A)l, and the estimate (22) for 
its norm |d(A)| is in fact better than we require. It remains to 
consider the restriction AAD to M. 

The matrix of dAMADY with respect to the basis Pir caer Dy of 
M is KAM, +AC(À), and we have seen in proving Lemma 3.2.1 that 
this is B(A) = Lb, AD], the transposed cofactor matrix of I-AA. Thus 


| k 
[aor] P, = > b, AN®;, 
i=1 
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JM 
KAMADY 1O., 0. > = b, (À). ‘ i 8 
<{ (A+ À e 9; > ijf ) < a {L+2| Alp HAL S l i exp(| n 
7 j= 


Given any unit vectors x and y in M, we may set 


n 
=e TI (i+|Alu,)*. 
x = a,0,+...4a,9,, y = B,9,+..-+8,9,, ce 1 
where Thus 
2 2 2 2 M i 
ja | + lall = |B,[“+..-+18,1° = 1. | Ld AAD "Ix, y >| < expa) nt (1+|Alpz,) 
J == 

Then 

r whenever x and y are unit vectors in M. It follows that 

CLLD x, y>= È b (Aa. n 
A To as NaayeaD || < exp(4) N App). 
SiG. 4B 
Pi Pr This completes the proof of (27). 
“4 
LEMMA 3.2.8. Form-=1, 2, 3,... 
I -AA n 
(32) jd | < m™ exp(m) ( $ p), 
a Paa 
k 
n 

(this last determinant reduces to the summation in the preceding line (33) ld f+D m- || < m™ exp(m+) C zA By" 
when it is expanded by the first row and the resulting determinants 
are then expanded by the first column). By Hadamard’s inequality Proof. We shall use the following result [60: p. 84]: if 


and equation (4) FO) Ş in 
= a 
z mag ” 
CLOD, y>’ 
is an entire function, r > 0 and M(r) = sup t{f(A)| : JA] = r}, then 


k k 
s ( ae TA) ; = HB; Al lal lan <TM (m=O, 1, 2, ...). 


+1Al? lag, [2 a [1-a 74, . +A? ]a,,175 It follows from (24) that, for the entire function d(A), 
k 9 k 2 2 n 
< TL H42AÀ| la HA > ja [+B M(r) < exp(r È p,) 
Ta jj pan d j jai 
k T r 2 Thus 
< m {2l la d+ Ale 2 Ja VT uHe] 
-1 jj PLAN j 


m 


n 
ld <a ezpere 2 A); 
j=l? 
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n 
and by taking r= m( È l p)? we obtain (32). 
J = 


Suppose next that x and y are unit vectors in H and let 
f(A) = ([dA+AD, |x, y> 


= <x, y>+ a A” [ld D__ idx, y>. 
It follows from (29) that, for this function, 


ME) < exp(her 3 p). 
ae 


Hence 


n 
Lld iD 1% y>| < r ™exp(tr . A H); 
J =< 


and since the last inequality has been proved for arbitrary unit 


vectors x and y, 


n 
|d I+D il] < 7 expr z p). 
J pæ 
n 
This reduces to (33) when r = mÈ > , T 
J = 


3.3. Fredholm formulae for trace class operators 

In this section we define the Fredholm determinant and the first 
Fredholm minor of a trace class operator acting on a Hilbert space, 
and show that much of the theory developed in $3.2 remains valid in 
this more general setting. The main results are Theorems 3.3.9, 
3.3.10 and 3.3.13. 

We introduce some notation that will be used throughout $3.3. 
Let T be a trace class operator acting on a Hilbert space H. It 
follows from Theorem 1.9.3 that there exist (finite or infinite) 


orthonormal sequences (0) and Y.) in H, and a sequence (u,) of 
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positive real numbers (the eigenvalues of (T*T)!/2) such that 
(1) Ex = 2 p, <x, PH, (x e K), 
j 
Ry equation 2.3(4), 
(2) rae, = HTa 


We shall usually assume that the index j runs through the set of all 

positive integers; for if the summation in (1) is finite, then T has 

finite rank and our results are already contained in those of $3.2. 
Since the trace class C: is an ideal in ($), 

re e@, (m=1,2,...). We define 


(3) o (T) =7(T™ (m= 1,2,...), 
m oO 1 
(4) @(n=Gle |! 
m m! C3 O 2 
Imel Smad Omes ı m-l 
Ti Tai m-2 Cy 4 
(m = 1, 2,...) and d(T) = 1, 
g 1 0 
(5) Dp (T)-68" |1 
m m! O O 
2 1 
73 Co Pif 
Oa On-1 [m9 ° Gy m 
patho ym g-i _, 72 fT 


(m = 1, 2,...) and DT) = T, where On = oT) in (4) and (5). 
We shal! consider T as the limit of the operators T» Ty ..., of 


finite rank, which are defined by 
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(6) Pos S H <x 9 >Y (ee H) This proves (i), and (ii) follows immediately since, by Lemma Pe NY 
AET ESA 
j=l 
a in m m m 
Corresponding to equation (2), we have P S a aN = |r(T E )| < [IT-T li 


The last two parts of the lemma are obvious consequences of (i), (ii) 


(7) u; = WT U4 <S vale und the definitions (4) and (5) of d_(T) and D (T). 


Mos 


1 


LEMMA 3.3.1. Form = 1, 2, 3,..., LEMMA 3.3.2. Form=1, 2,3,..., 


(i) |[T™-T™||, - 0 as n> w, (1) ld (D| < m™exp(m) || T|X; 
(11) On Ta) > p(T) as n — o, (11) |D (Di, < m” exp(m) | TIP. 


Torme 0 ewes Proof. By Lemma 3.2.8 and the inequality (7) 


(iii) d (T )—> d (T) as n —> o, ld, (7,)1 < m exp) ( E p) 
ae 


f | D Be -—> — co, 
(iv) | nD) DAT i, 0 as n < m™ exp(m) valve 


Proof. In proving the final assertion of Theorem 2.3.8 (that l l a AE 
By letting n — œ and using Lemma 3.3.1(iii), we obtain (i). 


is dense in C_) we have already shown that 
p By equation 3.2(18), Lemma 3.2.8. and equation (2) we have 


HTT y0 as Nn > œ%, l 
É ND aTa F IT aP m- KT Dan Ta Dll 


D (T +d (T 


whence ||T li > ||TI|,- Now | 
nilyil 


AN 


|T 


m 
-1 2 = 
|r -TH ly = Mies, Pe as aaa < ITaly m ™ expt) (2 u)" 
1 = 
m 
-Í a - z ei i m., 
< 2 rer) T™ | [,- < [T l; m Mexp(m+4) |TT; 


It follows from Corollary 2.3.11 and the inequality 2.3(5) that and (ii) follows, from Lemma 3.3.1(i) and (iv), by letting n — -. 


eed | 


< pes ae HT- ||, {| rie" We deduce from Lemma 3.3.2 that the series 
n 1 ni ni!l i 


f 


m=0 m = 


H MS Wms 


E TA z Ad, IN IDDI 


mt 


— 0 as n= œ. converge for all complex A. The entire function 
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(8) dA T) = > f a” d 0) 


will be called the Fredholm determinant of T. Since C; is complete 
the series & A”D (T) converges with respect to ||- ||} for all 


complex A. The trace class operator Dy r defined by 


(9) Dy r= <r A”D (T) 


will be called the first Fredholm minor of T. 


LEMMA 3.3.3. (1) dà, Ta) — dA, T) as n> œ, 
(ii) Byg Byr p asna 
(iii) (Dy r-Py 7 Il — Î as n —> œ. 

Proof. We shall prove only part (ii) of this lemma, since (i) 
follows from a very similar argument and (iii) is an immediate 


consequence of (ii). 


For each positive integer q, 
| [Dy rTPA, e | | S | a > 0 A™[D_(T)-D_ (TS ly 


q 
< 2 AIM D THD aTa + 


oO 


+ SPUD CUD aT]. 
m=qtl 


By Lemma 3.3.2(ii), applies to both T and rT, together with the 
inequality (7), we have 


q 
(10) HD) 7D) 7 ha <S ae ð APD TD Tall + 


+2 D |A m expr TIT. 
m=qtl 
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(Given a positive e, we can choose q so that 


2 E [Alm exp(m+4)||T[]""1 < Ye. 
m =qtli 


By Lemma 3.3.1(iv) there is a positive integer n, such that 


0 


q 
Z APLD p DD pT ll < he 


whenever n >n 


o It follows from (10) that 


Dy rP) r Il, < € 
n 
whenever n > Ny. 


LEMMA 3.3.4. For all complex À, 


(11) Ld(A, THAD) r) -AT) 
= (IATA, TAD) rl = dA, DI. 


Proof. By virtue of Lemmas 3.3.3(i), (iii) and 3.3.1(i), this 
result follows at once from the corresponding relations for L (see 


Lemma 3.2.1). 


COROLLARY 3.3.5. If dA, T) # 0 thenI-AT has an inverse 
Kiven by 


(12) (I-AT)* = BALLA, DID) p- 


Our next aim is to show that, if d(Ao» T) = 0 and Ào has 
multiplicity k as a zero of d(À, T), then a is an eigenvalue of T 
and has algebraic multiplicity k. Before proving this we require two 
auxiliary results. In the following lemma, d‘(A, T) denotes the 


derivative, with respect to A, of dA, T). 


LEMMA 3.3.6. 7(D) p) = -d'(À, T). 
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Proof, It follows from (5), (4) and the comments following those 


equations that 


r(D,(T)) = (T) = 0, = -d (T), 


Q 

a 
© 
© 
© 


MDAC ies De e 
ae m! 


To C) 2 
3 o 
Ja Og Caia A m 
Goe Wx Wiad O 1 
= -(m+1) deg E) m Ge ee 
Since the series (9) for D) ,, converges with respect to ||-||, and7 


is a continuous linear functional on Ca we have 


Dy) = È ADT) 


=- 5 i (mtDA” d (D 
= -d'(À, T). 


LEMMA 3.3.7. Let K be a trace class operator acting on a 
Hilbert space H and let Î be the open set consisting of all non-zero 
complex numbers À for which X`} is not an eigenvalue of K. For 


each à in I) define an operator Ky on K by 


(I-AK)! = HAK.. 
Then K, is a trace class operator; and if Àg € D then r(K\) is 


bounded on some neighbourhood of Ao: 
Proof. Since (I-AK) (AK 2 =! we have 


K, = K(HAK,); 
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nO Ky is in the trace class and 


(13) In(Ky)| < KEAK! < IKI [AKL 


1 
Iy Theorem 1.5.1, the operator-valued function À —> DAK, = ARY 
is analytic on D, and is therefore continuous with respect to the 
norm topology on B(H). It follows that, if Ay € P then E+AK y |] is 
hounded on some neighbourhood of Àg; and, by (13), the same is true 
ol (Ky). 


LEMMA 3.3.8. Let Ao be a complex number such that dA; T)=0. 
Then A is an eigenvalue of T and its algebraic multiplicity as 


nuch is equal to the multiplicity of Ào as a zero of dA, T). 
Proof. Suppose that A, has multiplicity k as a zero of d(A, T). 
Then 
(14) dA, T) = (Ap) 8), 
where g is an entire function and g(A,) 4 0. We may choose a 


positive ô such that g(A) # 0 when |A-A,)| < ô. If O< |A-A,| < then 
KX, T) # 0 and, by Corollary 3.3.5, 


(15) (-AT)* = LAS, (0 < |A-A,| < 9), 
where 
(16) S, = [d@, D Dye (0 < |A-Ay| < 8). 


By Lemma 3.3.6, 7(S)) = -d (À, T)/d(A, T); and from (14) we have 


(17) rS) =  -AW (0< [Agl < 8), 
À gÀ 
where A(À) = 6 (A)/g(à) and thus A is bounded near Xo: 
Since r (Sy) is unbounded on each neighbourhood of Ag and Ao ZQ 


(because d(0, T) = 1), it follows from Lemma 3.3.7 that Ne is an 
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eigenvalue of T. Suppose that, as such, it has algebraic multiplicity 


m. By Theorem 1.8.1 there exist closed subspaces J and R of K, 


both invariant under T, such that 
(18) N+R = K, NNR = (0), 


N is m-dimensional; and, if a and ps denote the restrictions of T 
to Jt and R, respectively, then LAT is nilpotent and I-A TS is 
invertible. Since an orthonormal system in R is also an orthonormal 
system in H, it follows at once from Definition 2.1.1 that rr is in 
the trace class of operators on R. We shall use the decomposition 
of T relative to Ñ and R to obtain a second expression for r(S)) 
which, when compared with (17), yields the required equation k = m. 
By Theorem 1.8.3(iii), any closed subspace of H which is 
invariant under T is invariant also under (I-AT)~! for all complex 


numbers A such that this inverse operator exists. In particular, Jl 


and ‘R are invariant under (EAT). It follows that 


R 


(ye wae Lash (0 < |A-A,| < 8), 


where sh is the restriction to K of S,. By Lemma 3.3.7 (applied to 
pry, Si is a trace class operator and its trace is bounded near Xp. 


We suppose that 
(19) (S| <M < Agl] < 4). 


Let Pir ce Ph be an orthonormal basis of J, with respect to 
which rit has a superdiagonal m x m matrix A = la, J. Since I-A) 
is nilpotent, so is its (superdiagonal) matrix I a Ag4 Hence the 
diagonal coefficients I-A; in this matrix, which are also its 


eigenvalues, are all zero. Thus oa; st (G= 1, «<i; m), and 
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7 N 
(Ao TY, = -AoT 9, 
j 
7 Ta = Ào a 
1 


lt follows that, if ug is the subspace spanned by 9, ..., P; 
(7 1,..., mand Ro = (0), then 


-A T) M) C Hi G=1,..., m). 


Since Q is invariant under T, 
(20) ADOR c RaR Gal, ..., m). 


The subspace R +R is closed, by Theorem 1.1.1; it is spanned 
hy P, and JU sk, and t é N rh (otherwise, 


9, = a,0;+. AE lag a 


lor suitable scalars a,,..., as and some x in R, and x isa 
non-zero vector in JUN R, contradicting (18)). Hence we may choose 
a unit vector H, in N +R which is orthogonal to TUR; TLR is 
spanned by Y, and N+, so H(= JGR = JU +R) is spanned by 
TERET Dn and R. Thus, if t0, : j € J} is an orthonormal basis of 
'R, then Ws, RERS yr U (0, : 7 € J} is an orthonormal basis of H. 

It follows that 


II 


m | R 


J 


that is, 


(21) (S, ) = ear [Syt Y >r) (0 < JA~Ao | < 6). 
= 
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The next step is to compute OYA >. It follows from (20) 
that N +R is invariant under T; as noted above, this implies that it 
is invariant under TRP for all complex numbers A such that this 
inverse operator exists, and so invariant under S} when 
0 < JAA | < 6. In particular, SAY; € J +R, so we may choose a 


complex number a and x in N yah satisfying 


(22) SY = aw +x. 
Then 


S 
li 


(I-AT) (IAS vy; 
= (U-AT) [Aay +À x] 
= (1+Aa) Ie (I-A, n) ib +AQ-AT) x 


| 


0 0 
= (1+Aa) 1-A Wty, 
Ag} F 


where 


y = sna) -Ag Typ +AU-AT) x. 
O 


Since x € N tR and this subspace is invariant under T, it follows 


from (20) that y € N yt. However, b, É N +R, soy = 0, 


I+va) f1-* \ = 
cra) (IS) 1, 


and thus a = Ao Since x € N y+ and v, E M aR, we 
can now deduce from (22) that 
O te Se Tn 
A AA 
This, with (21) yields 
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(23) (Sy) = M yy (shy (0 < |A-A,| < ay 


dona 


Comparison of (17) and (23), noting that A(A) and (Sk) are both 


bounded near Àg?» shows that m = k. 


We now summarise, in the form of a theorem, the main results so 


far obtained in $3.3. 


THEOREM 3.3.9. Let T be a trace class operator acting on a 
Hilbert space H, and suppose that the Fredholm determinant dÀ, T) 
and the first Fredholm minor D) ņ are defined by means of equations 
(3), (4), (5), (8) and (9). Then the series for dA, T) and Dy r 
converge for all complex A (the latter with respect to the norm el, 
on the trace class Cok Furthermore 

(i) if à 1s a complex number such that d(A, T) # 0, then I-AT 


has an inverse given by 
-AT = LAdA, TID) p 


(ii) if À is a complex number such that d(A, T) =Q, then At is 
an ergenvalue of T and its algebraic multiplicity as such 1s equal 
to the multiplicity of À as a zero of d(A, T). 

We now obtain estimates for the rates of growth of d(A, T) and 


D, pas |A, > œ. In the following theorem l is the constant 


introduced in 3.2(20). 


THEOREM 3.3.10. (1) Suppose that T is a trace class operator 
acting on a Hilbert space K, d(A, T) is the Fredholm determinant of 
T, Dy ņ is the first Fredholm minor and r is a positive integer. 


Let (H) be the infinite sequence consisting of the non-zero 
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eigenvalues of ery counted according to their algebraic 
multiplicities and followed by zeros if the number of such 


eigenvalues is finite. Then 


(24) IKA, T)| < exp(|Arl 5 p) TE GAl); 
j=r+l j=1 


oO 


(25) ||, TH+AD) pli < expCaelAl 2. Hy ug (1+/Alp,); 


(26) Id, T)| < exp(iAl [ITI]; 
(27) dA, THAD) pli < expl [IT]. 


(ii) Suppose, further, that 0 < p <1 and pe converges. Then 


(28) Id, D| < exp AP = ph) IL G+\Aly,); 
Be pad Ppa : 
(29) ||d, DEAD) rll < exp Al Fes (1+/Alz,); 
(30) dA, T)| < expC [AP E pP); 
P j=l J 


(31) IKA, TH+) pll < expt AP 2 Hy: 


Proof. Suppose n is an integer such that n > r. By 3.2(23) and 
3.2(28) we have 


dQ, TDL < exp(|Al Zw) H GHAlap, 
n j= l J j=1 J 


|| dQ, TE + AD). rll < exp (er =. Hy) 19 G+àlp,). 


By taking limits as n — œ and appealing to Lemma 3.3.3(i) and (iii), 
we obtain (24) and (25). 
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The proofs of (26), ..., (31) are similar; in each case we quote 
lhe corresponding result for T , which has already been proved as 
part of Lemma 3.2.6 or Lemma 3.2.7 For (26) and (27) we also make 
usc of (7). 


COROLLARY 3.3.11. Suppose that <€ > 0 and that the 
hypotheses of Theorem 3.3.10(1) are satisfied. Then there is a 


constant M such that, for all complex x, 
(32) [dA, T)| < M exp(eļàl), 
(33) | dQ, T)I+AD) rll < M exp(elAl). 


Suppose, further, that O< p<1andÈ pe converges. Then there 


is a constant N such that, for all complex À, 


(34) Id, T)| < N exp(e|Al?), 
(35) dA, THAD, pil < N exp(<lAl?). 
Proof. Since È p, = ||T||, < œ, we can choose an integer r such 
j 
that 
(36) XS p< he, 
jurtl J 


IF M is defined to be the supremum, for t > 0, of the function 


exp(4-“%et) H (1+tu,), 
j=l 
we have 


r 
(37) exp(Z) H (ltp) < M explet) (t > 0). 
po 
From (36) and (37) it follows that, for all complex A, 


exp(+|Al 2È H,) H (+lAln.) < M exp(e|Al). 
j=rtl j=1 J 
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With the aid of this iast inequality, (32) and (33) are immediate 
consequences of (24) and (25), respectively. 


If & p? < œ we can choose an integer r such that 


[oe 
T a pP< Ye, 
P i= r+ J 


and define NV to be the supremum, for t > 0, of the function 


exp(4-“ert?) I Gtt). 
j=1 


An argument similar to the proof of (32) and (33) then shows that 
(34) and (35) are immediate consequences of (28) and (29) 


respectively, 


Note that the inequality (32) implies that d(A, T) is an entire 
function of finite order at most 1, and that if its order is exactly 1 
then it is of minimum type (for the terminology used here, see 
$1.4). When X, Di converges, (34) shows that d(A, T) has finite 
order at most p, and is of minimum type if its order is exactly p. 


Similar comments apply to the operator-valued entire function Dy p. 


LEMMA 3.3.12. Let T be a quasi-nilpotent trace class operator 


acting on a Hilbert space H. Then dA, T) = 1 for all complex À, 
and7(T) = 0. 


Proof. Since T is quasi-nilpotent, it follows from Theorem 3.3.9 
that the entire function d(A, T) has no zeros; and, by (32), it has 
finite order at most 1. The Hadamard factorization theorem (Theorem 
1.4.2) shows that d(A, T) has the form exp(a+6A), where a and £ are 


constants, Since 


exp a = d(O, T) = 1, 
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d(A, T) = exp(BA). Given any positive <, it follows from (32) that 
IB| < e, so =0. Thus, for all complex À, 


1 = d(A, T) = LENG CT ease, 


whence, by comparing coefficients and using (4), we have 


7(T) = -d (T) = 0. 


It follows from Corollary 2.3.6, with p = 1, that the eigenvalues 
of a trace class operator T form an absolutely convergent series. 


We now show that the sum of this series is 7 (T). 


THEOREM 3.3.13. Suppose that T is a trace class operator 
acting on a Hilbert space K, and (A;) is the sequence of non-zero 
eigenvalues of T, counted according to their algebraic multiplicities. 
Then 

rT) = DA, 
; 


and the Fredholm determinant d(A, T) 1s given by 


d(A, T) = T(1-AA,). 
j 


Proof. Let M be the closed subspace of H which is generated 
by the principal vectors of T*, associated with non-zero eigenvalues, 
and let N =M”. By Theorem 1.8.5, Jl is invariant under T and the 
restriction ri of T to Ñ is quasi-nilpotent. Since any orthonormal 
system in Jl is also an orthonormal system in K, it follows at once 
from Definition 2.1.1 that rl is a trace class operator on Jl, By 
Lemma 3.3.12, (T = 0. 

The sequence of non-zero eigenvalues of T*, with the correct 
multiplicities, is A). By Theorem 1.8.6 there is an orthonormal 


basis iP} of M such that <T*9., > = ie whence T9, 9, P= di. 
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Let ty, i be any orthonormal basis of N. Then tO} U iv Sis an 


orthonormal basis of H, and 


7(T) a 2 <TO,, p, +2 LTY We? 
= 5 aar TÙ 
aA Ào 
J 


By Theorem 3.3.9, Ay") is the sequence of zeros of the entire 
function da, T), with the correct multiplicities. Since d(A, T) has 
finite order not greater than 1, and È [Àl < co (Corollary 2.3.6), the 
Hadanfard factorization theorem (Theorem 1.4.2) shows that, for 
suitable constants a and $, 

(38) dA, T) = expla+ßà) PQ), 
where 


P(A) = H Q-AA). 
j 
Since È |A| < œ, the partial products 
n 
P(A) I GAD 
j=l i 


converge to P(A), uniformly in any bounded subset of the complex 


plane [12: $5.32, p.104]. It follows [12: $5.13, p.95] that 


P'(0) lim P’ (0) 


n > w 


= lim lL- È Aà] 
=1 7 


n > co j 


it 


D 
mae 
-7(T). 
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When A = 0, (38) gives 1 = exp a, so 


(39) d(À, T) = exp(BA) PQ). 
Thus 
dA, T) = Bd(r, T)exp( BNP (N), 
and 
7(T) = -d,(T) = -d 0, T) 
-P-P 0) 
-B+r (T). 


Hence 6 = 0 and, by (39), 
d(x, T) = P() = II (1-AA_). 
j 


3.4. The resolvent of a quasi-nilpotent operator 
Suppose that T is a quasi-nilpotent operator acting on a Hilbert 


space H. Then 9, defined by 
PA) = CADE, 


is an operator-valued entire function. Our main purpose in this 
section, achieved in Theorem 3.4.6, is to estimate the rate of 
growth of ||P(A)|| as |A| — œ, given that T is in the von Neumann- 
Schatten class R When Ñ is finite-dimensional the problem is a 
trivial one; accordingly, to avoid exceptional cases, we assume 
throughout $3.4 that H is infinite-dimensional. 

Up until now we have considered the space C only for p >1, 
but the work in this section requires the introduction of a class C 
for each positive p. Suppose that T is a compact linear operator 


acting on H. Then T has a polar decomposition T = V,.H,., where 


F 
Ry (= VAT = GFA) is a positive compact operator. We shall 
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denote by (u(T)) the sequence of non-zero eigenvalues of H.,,, 
arranged in decreasing order and counted according to their 
multiplicities. When T has finite rank n, the same is true of H,,, 
and so Hp has only n non-zero eigenvalues; in this case, it is 
convenient to define p(T) to be 0 when j >n. With this convention, 
if T is any compact linear operator on H, then ((T)) is a 
decreasing real sequence with limit 0; and Theorem 1.9.3 shows that 


there exist orthonormal sequences (?), (Wy) such that 
(1) Tx = - n <x >v, (x eH), 
with K = p(T). 


DEFINITION 3.4.1. Suppose p > 0 and H is an infinite- 
dimensional Hilbert space. Then C, is the set of all compact linear 


operators T on H such that 


2 u; (TP < œ, 
J 
For T in C ; 
P 


From the final statement of Theorem 2.1.6, the remarks preceding 
Definition 2.3.2, and equation 2.3(4), it follows that the above 
definition is consistent with the terminology used, when p > 1, in 
Chapter 2. 

Clearly, C Ss a when q> p> Q. It is not difficult to show that 
C, is a two-sided ideal in RH), and contains the adjoint of each of 
its members; however, |! if is not a norm on C when0<p<1. 

We omit the proofs, since we shall not make use of these results. 
The only additional property of cn required below is a weakened 
form of Holder’s inequality. This is given in Lemma 3.4.4, after 


some auxiliary results. 
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LEMMA 3.4.2. Suppose that T is a compact linear operator 


acting on a Hilbert space HK, and nis a positive integer. Then 
(2) |T-F || > uT) 


for each operator F on H with finite rank less than n. Equality 


occurs 1n (2) for at least one such F. 


Proof. We may suppose that T is given by equation (1) and that, 
for suitable y, and Z, inH(1<j<n) 
n- 1 
Fx= È <x, yz, (eH). 
There exist complex numbers Ups ry @ such that 
n 


x Jal = 1, 


i=l i 


IMs 


tna =e (l <j <n). 
If x is the unit vector È aO., then Ce y; = 0 (1 < j < n)and so 
Fx = 0. Thus 

|| T-F||* > ||Tx-Fx||* 


= ||Tx\||? 


i 


EAA 


TONGAR 


fi 
i M3 


i 1 


W 


Te ae Z lal" = p, 


and ||T-F || > (T). It is easily verified that equality occurs when 


n-i 


Fx = oe k, £x, PY, (x e Í). 
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LEMMA 3.4.3. Suppose that R and S are compact linear Hence 
operators acting on a Hilbert space H and m, n are positive integers. S u (RS) 7 5 lua (RS) ug (RO)! 
Then j= 1 J n= 1 


Ltn RS) < u (Rp, CS). 


x 


0° ; ; 
2 : > ! u (RY u C5) 
Proof. By Lemma 3.4.2 there exist operators R, (with finite 


x 


rank not greater than m-~1) and S4 (with finite rank not greater than 


A E p (RELALE pO Y” 
asd Å n= 1 
n-1) such that 


i 


a Sw (RV LE pOT: 
|R-R,|| = #,,(R),  |/S-S,!| = 4,9). aei n=1 


; ‘de is fini S eĈ; and, by taking tth. roots 
The operator F, defined by Since the right-hand side is finite, R pa y 


of both sides, we obtain the required inequality. 
F = R,(S-S,)+RS,, i 


LEMMA 3.4.5. Suppose that T € Ca where p > 1, and let m be 
a the smallest integer greater than p. Then 


me m/p) |T| 
m ea 2 A 
Hmtn- (R9) < IIRS-F || T” e Cpm IUT" p/m p 
= ||(R-R,)(S-S,)]| Proof. We first prove, by induction on j, that 
] j 
< [IR-R,|| ||S-S,!| 3) eC, UP, < UPI 
ana for j= 1, 2,..., m-l. When j = 1, there is nothing to prove. 


Suppose that 2 < k < m, and that (3) is satisfied when j = k-1. By 


The following result is of interest only when t <1, since 
ie l Theorem 2.3.10, with r = p/(k-1), s =p and t = p/k © 1), 


Theorem 2.3.10 gives a sharper inequality when t >1. 


TK = TEIT eĈ jie 
oe p/k 
LEMMA 3.4.4. Suppose that r, s, t are positive real numbers 
2 ae ss -ł 
such that r-++s7! = t~}, and that R EX SE Co Then RS € C and alae < ||T* NN o/Ge- all F ll 


RSI, < 2°"1iR|LAISI|.,. < ZUETHHIZI = WIG. 


Proof. Leta= rt), b= sto} so that a7~/+57! = 1. It follows 


This completes the inductive proof of (3), when L< 7a 
from Lemma 3.4.3 that 


We now apply (3) (with j = m—-1) and Lemma 3.4.4 (with 
Hy (RS) < Ho, 16RS) < La RL (S) (n = 1, 2 Si ae r= p/(m-1), s= P, f= p/m) to obtain 
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Pa Gey Cig 
p/m 


ial s ae) 


nll esas alts 
CPT 


p/m 


THEOREM 3.4.6. Suppose that p > 0 and T is a quasi-nilpotent 
operator of class C acting on a Hilbert space H. Then 


(1) there exist constants he and M, (depending only on p) such 


that, for all complex À, 
(4) G-ATY"|| <M, exp, AIPHT]; 


(11) given any positive €, there 1s a constant M (which depends 


on p, T and e) such that, for all complex A, 
(5) U-AT)“"|| < Mexp(e|a|P), 


Proof. We consider Separately two cases. 


Case 1. 0<p<1. Since C ce T is a trace class operator, 


and the theory developed in $3.3 is available. By Lemma 3.3.12 
and Theorem 3.3.9, dA, T) = 1 and 


(6) C50) = LAD aT = dA, DHAD) q 
for all complex A. By (6) and 3.3(31), 
[ADI < exp APE p (TP) 
= exp(⁄2) expl JAIP hI TIIB) 


for all complex À, where Da is defined by 3.2(20) and depends only 
on p. By (6) and 3.3(35), there is a constant N such that 


|U-AT)“"|| < N exp(eiàlP) 


for all complex À. This completes the proof of the theorem in case 1. 
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Case 2. pè 1. Let m be the smallest integer greater than p. 

By Lemma 3.4.5, 
1/ m 

(7) pees Wie 2 elles 
where q = p/m. Since 0 < q< 1 and T” is a quasi-nilpotent element 
of Ê , the results established in case 1 are applicable to T”. It 

q 
follows that, for all complex A, 

GATT)" | < expel ATITID 
< xpos 2M, APIT], 

(8) [GAM TY] < expt 2T IMP IITIID). 
Furthermore, there is a constant N such that, for all complex À, 
(O) [AFTE] < N exp(HelA™|2) = N exp(elal?). 

Since 

ONT ADA T eset TA UAD, 
we have 
ER -ANT T ea TEN T 
From this last identity and the fact that || T|| < as it follows 
that 
| 2 2 
GO [GATT] < GHAL TIHA ITIS +. -+ 
HAP HITHERTO -AT ]]. 

There exist constants K, (depending only on p) and K such that, for 
all positive f¢, 


(11) Letrt +... ETTI < K exp(), 


(12) Lee|[T\[ +07 [| 7] [2+ PTET K exp ct”), 
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From (10), (11) and (8), 


(AT) “|| < K,exp(|al?||T)|2) expGs+20|AlP||7|I2) 
= M, exp(T’,|AlP||T|[2), 


where M = K expC) and L; = 21) +1. By (10, (12) and (9), 


I\U-AT)“1 || < K exp(helA|PW exp(%el Al) 
= M exp(elAl?), 


where M = KN. This completes the proof of Theorem 3.4.6. 


Theorem 3.4.6(ii) shows that, if O(A) = (I-AT)~! for some 
quasi-nilpotent operator T of class O where 0 < p < œ, then the 
operator-valued entire function Ọ has finite order at most p, and is 


of minimum type if its order is exactly p. 


3.5. Some applications 

Suppose that T is a compact linear operator acting on a Hilbert 
space H, and let m be the null space of T and Ry the closed range 
space of T; that is, Jer ={xeH: Tx = 0}, while A is the closure 
of {Tx: xeHh Ifxisa principal vector of T, associated with a 
non-zero eigenvalue p, then (T-Y x = 0 for some positive integer 


n, and from this it follows that x € Rigs Hence 
(1) PCR 


where Ia denotes the closed subspace generated by the set of all 
principal vectors of T associated with non-zero eigenvalues. In 
this section, we are interested in finding conditions under which 
f = Rai that is, conditions under which, for each x in K, Tx can 


be approximated by linear combinations of principal vectors. 


9 
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Since Re = core it follows from (1) that 


with equality if and only if ae = Kogi 

From the foregoing discussion, we see that Jos Ra if and only 
if T” x = 0 for each x in P If T satisfies the stronger condition, 
Tx = T*x = 0 for each x in PT, then A = Ry and, in addition, the 
closed subspace generated by the principal vectors (associated 
with non-zero eigenvalues) and nuli space of T is the whole of K. 


oe : * d 
We recall from Theorem 1.8.5 that i is invariant under T”, an 


+, : : d 
the restriction N of T* to I is quasi-nilpotent. If l<p<oan 


T €C, then the self-adjoint part A = %(T+T*), the skew-adjoint 
part B- 12i(T*-T) and the adjoint T* of T all lie in Cai since on 
orthonormal system in a is also an orthonormal system in K, it 
follows at once from Definition 2.1.1 that N € a 


The following result is due to V. B. Lidskii [39]. 


THEOREM 3.5.1. Suppose that T 1s a trace class operator 
; = d 
acting on a Hilbert space K, and ⁄i(T*-T) >20. Then ee = Roy an 
the closed subspace generated by the principal vectors (associated 


with non-zero eigenvalues) and null space of T is the whole of K. 


Proof. By the preceding discussion, it is sufficient to show that 


Tx = T*x = 0 whenever x € 9>. If A = 4A(T+T") and B = i(T*-T), 
then A and B are self-adjoint trace class operators, B 20, T= A+iB 
and T* = A-iB. The restriction N of T* to Po is a quasi-nilpotent 
trace class operator and, by Lemma 3.3.12, 7(N) =0. If (y) is an 


orthonormal basis of P~., then 
0 = (N) = È CNY, Y? 
Z <T"Y;, Y? 
= È {CA Y-i Bh; Par 
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Since B > 0, it follows by taking imaginary parts that LBY, Ý, >=0 
for each j. Thus By, = = 0 and, since linear combination of Y s are 


< = 
dense in I T? 


(3) Bx=0 (xe fa): 
Hence 
(4) Ax = (A-iB)x = T*x = Nx (x€ 92), 


and <Nx, y> = (Ax, y> = <x, Ay> = <x, Ny> whenever 
x% y€ P It follows that the quasi-nilpotent operator N is also 
self-adjoint; Corollary 1.7.5 shows that N = 0 and, by (4), 


(5) Ax=0 (x€ Pd. 


By (3) and (5), Tx = T*x = 0 for each x in oe 


The following theorem is due to V. B. Lidskii [37, 38] and 
L. A. Sahnovic [56]. 


THEOREM 3.5.2. Suppose that T €C,, ¥(T+T*) > 0 and 
Yyi(T*-T) > 0. Then ae = Is and the closed subspace generated 
by the principal vectors (associated with non-zero eigenvalues) and 


null space of T is the whole of the Hilbert space H on which T acts. 


Proof. Once again, it is sufficient to prove that Tx = T*x = 0 
whenever x € I The restriction N of T* to I is a quasi-nilpotent 


operator of class Cs so N* is a quasi-nilpotent trace class operator 
and, by Lemma 3.3.12, 


(6) (N°?) = 0. 


Let N = H-1K, where H and K are self-adjoint operators on ea 
Then H and K are of class om since N€ Cm For each x in m 
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<Hx, x>-i <Kx, x> = Nx, x» 
<T*x, x> 
< Ax, x>-i <Bx, x>, 


where A = 4(T+T*), B = %i(T*-T). Since A and B are positive 


Il 


ll 


operators, so are H and K. By (6), 
0 = r(N2) = 1(H°-K°)-i(HK+KH), 


and, since (HK) = (KH) = 7((HK)") = r(HK) by Theorem 2.3.13, it 
follows that 


(7) (HK) = 


We may suppose that 


where (9) is an orthonormal sequence in a and (H; is a decreasing 
sequence of positive real numbers. Let (Y, 5 be another orthonormal 
system, such that (9. Uw, is an orthonormal basis of saa . By (8), 


Ky, = 0 for each k; so 


0 = 7(HK) 


Z CHKO, 0,9 +% LHK v b> 
j 


È pj LHP, 9>. 


j 
Since k; > 0 and H è> 0, it follows that HỌ, = 0 for each j. By (8), 
HK = 0; thus AH = (HK)* = 0 = HK, and N (= H-iK) is both normal 
and (as noted above) quasi-nilpotent. By Corollary 1.7.5, N = 0. 


For each x in fo 
= (Nx, x> = Ax, x-i <Bx, x). 


Since A and B are positive operators, it follows that Ax Bx 0, 


so Tx = T*x = 0, and the theorem is proved. 


CHAPTER 4 
Superdiagonal Representation of Compact Linear Operators 


4.1. Introduction 

An n x n matrix A = la, J is said to be superdtagonal if ap = 0 
whenever j > k; the eigenvalues of such a matrix are its diagonal 
elements ajy 4), +o app An elementary theorem (see, for 
example, [26: p. 144]) asserts that, if B is an n x n complex matrix, 
then there is a unitary matrix U such that U-1BU is superdiagonal. 
This result can be reformulated in the following way [26: pp. 107, 
144]; if T is a linear operator acting on an n-dimensional Hilbert 


Space K, then there exist subspaces Lo L,, soa L of H such that 


Gi) (}=L C aC TESA 
(ii) L, is j-dimensional (j = 0,1, ..., n), 
(iii) each L, is invariant under T. 
jai 
les pacen es is an orthonormal basis of K, with respect to which T 


If, for each j= 1, ... n, e, is a unit vector in LN L then 

has a superdiagonal matrix A. The eigenvalues of the operator T, 
which coincide with those of the matrix A, are therefore precisely 
the diagonal elements of A. It follows easily that the eigenvalues 
of A, counted according to their algebraic multiplicities, are the 
scalars A,,..., A, determined by the condition Te —\.e, € Le 


1 
Gj=1,..., n). Since e; 3 Lo we deduce that 


0 = <Te,-A,€,, e> = (Te, e» A, 
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_— that the family F consists of subspaces Leg lean red L satisfying 
ee Tere > conditions (i) and Gi) above, and we can choose an orthonormal 
l nN basis e,, ..., e. of H such that e, EL, N Ley G=1,..., n). The 
The operator T is nilpotent if and only if A, = O21, icc ie aad t i 


itia ith respect to this 
i i erator S has a hermitian matrix Ls. wi 
this is equivalent to the assertion that Te, € Ley GS es oak Ht). given op j 


: ; i n H and leaves invariant 
n T(I CI is. If T is a nilpotent operator acting o ; 
Since L, is the linear span of e, and Loy and ( me T +f basis p p 


l Í ( ) a L j poe 1 G he 3 ov ey n), J J 


is i j d nilpotent, and 
i i i thonormal basis is both superdiagonal an 
The preceding discussion suggests that a family Styen ORRO 


| fore satisfies 
tL, ‘j=l, ..., n} of subspaces, which has properties (i), (ii) and therefor 


(iii) above, determines a ‘superdiagonal representation’ of T. In (1) tin = 0 CRG ST): 
the present chapter, we consider the application of this idea to 


The required condition, that S = %4i(T*-T), is equivalent to 
compact linear operators acting on infinite-dimensional Spaces, In 


$4.2, we prove the theorem, due to Aronszajn and Smith [1], that a -Hit k G < k), 
compact linear operator T acting on a complex Banach space X has (2) Się = 0 (j= k), 
a proper closed invariant subspace (this result has recently been Atty j G > k). 


extended in a number of ways; see [2, 3, 4, 13, 17, 27, 28]). In $4.3 


Since Ls., | is a hermitian matrix, a solution Lt. J of (1) and (2) 
the Aronszajn Smith theorem is used, with a simple Zotn’s Lemma m 


i ists if and only if 
argument, to prove the existence of a maximal totally ordered i g 


family f of closed subspaces of X, each of which is invariant under (3) = 0 Gah... n); 
T, It is then possible to obtain Satisfactory analogues of the 


ini the solution is then unique, and is given by 
finite-dimensional results, described above, concerning the 


eigenvalues of T and the conditions under which T is quasi-nilpotent. 218; k G < k), 


4 t = 
In $4.4 we consider the superdiagonal representation of compact 4) J8 0 G > k). 


linear operators acting on Hilbert spaces, together with the following 


$ It is not difficult to show that, if E, denotes the projection from K 
question: if f is a maximal totally ordered family of closed 


onto L., then (3) and (4) can be rewritten in the form 
subspaces of a Hilbert space H and S is a compact self-adjoint j 


operator on K, is there a quasi-nilpotent operator T on H which has (35 (E-E SE -E 1) DE ame as 
Skew-adjoint part S and leaves invariant each subspace in f? In 5 
order to clarify the nature of this question, we now consider the (4") T = 21 cat E p1 SE Ey 1) 


problem in finite-dimensional Spaces, In this case, we can Suppose 
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Our conclusion can therefore be stated as follows: the condition (3^ 
is both necessary and sufficient for the existence of a nilpotent 
operator T on H which has skew-adjoint part S and leaves each of 
the subspaces L, invariant; when (3’) is satisfied, the operator T is 
unique, and is given by (4°). 

In $4.4 we consider the problem, in the form set out at the 
beginning of the preceding paragraph, in the infinite-dimensional 
case. It turns out that there is a simple condition on S, analogous 
to (3'), which is necessary for the existence of an operator T with 
the requisite properties. Unfortunately, this condition is not 
sufficient, it can be strengthened, so as to become a sufficient 
condition, in various ways (for example, by requiring S to lie in a 
von Neumann-Schatten class Co p <œ). No simple necessary and 
sufficient condition for the existence of a suitable operator T is 
known. Given a general compact self-adjoint operator S, it is not 
difficult to show that, if such an operator T exists, then it is unique 
and is compact; furthermore, T can be expressed, in terms of S and 
F, as a ‘superdiagonal integral’ analogous to the sum occurring in 
(4'). In view of equation (4) it is reasonable to regard T, when it 
exists, as the ‘superdiagonal part’ of S relative to the given totally 
ordered family f of subspaces of K. 

In $94.5 and 4.6 we consider the integral operator V, acting on 
the space L,(0,1) (Lebesgue measure) which is defined by 


(5) (Vx)(s) = if x(Ode (x € L(0,1) : O< 8 <D. 


If O<A< 1, then the closed subspace 


(6) Ly = {x €L,(0,1) : x(s) = 0 almost everywhere on [0, AJ} 


of L,(0,1) is invariant under V. It turns out that there are three 


Simple properties of V which characterize this operator up to unitary 
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equivalence (that is, any operator with these properties is unitarily 
equivalent to V). Furthermore, the subspaces L} (0< A< 1) are the 
only closed invariant subspaces of V. Thus V is an example of a 
unicellular operator; that is, an operator whose closed invariant 
subspaces are totally ordered by the inclusion relation G. In $4.5 
we prove these results by using the theory of superdiagonal integrals 
developed in $4.4; an alternative, more elementary, treatment is 
given in $4.6. 

The results described in $4.4 were originally obtained by various 
Russian mathematicians, notably Brodskii [6], Gohberg and Krein 
[18, 19, 20] and Macaev (44, 45]. The characterization (up to 
unitary equivalence) of the operator V is a special case of more 
general results due to Livsic [42]. The unicellularity of V was first 
proved by Dixmier [14]; subsequently, a number of proofs have been 
given [5, 15, 30, 8]. The treatment given in $4.5 is based on [8], 
and I am indebted to A. L. Brown for pointing out that the method 
used in [8] to establish unicellularity can easily be adapted to 
obtain the characterization of V up to unitary equivalence. The 
methods used in $4.6 are derived from [5] and [30, 31]. The 
unicellularity of certain other Volterra integral operators has been 


established by Kalisch [29]. 


4.2. Invariant subspaces of compact linear operators 


Our purpose in this section is to prove the following result. 


THEOREM 4.2.1. If T is a compact linear operator acting on a 
complex Banach space X of dimension greater than 1, then T has a 


proper closed invariant subspace. 


The original proof of this result [1] has subsequently been 


simplified a little; the version given below is derived from (16). 
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Before embarking on the proof, we require some further notation and 
a number of auxiliary result. 

Throughout $4.2, T is a compact linear operator acting ona 
complex Banach space X. In the case of finite-dimensional Banach 
spaces, the result of Theorem 4.2.1 is contained in the elementary 
linear algebra discussed at the beginning of $4.1; we may therefore 
assume that X is infinite-dimensional. The closed subspace of X 
generated by a (finite or countable) subset ER Kon rai | will be 
denoted by Ea Royi .). Once and for all, we choose a unit vector 
ein X. The closed subspace Xo = le, Te, T e ...) iS invariant 
under T, and is not {0}. If X} 4 X, then X, is a proper closed 


invariant subspace of T; we may therefore assume that X =X. Let 
(1) X = STE Te yee TOS). Ged, 2, 29:): 


If the vectors e, Te, T'e ... are linearly dependent, there is a 
positive integer m such that T™e € X_; from this, it follows easily 
that T’e € X, (n= 0, 1, 2,...), and X_ coincides with X (= X), 
contrary to our supposition that X is infinite-dimensional. Hence 
the vectors e, Te, Te, ... are linearly independent, and the subspace 
A „is n-dimensional. In view of the preceding discussion, we shall 


assume in future that 
(2) the vectors e, Te, T e, ... are linearly independent, 
(3) X = (e, Te, Te, ...). 


If Z is a closed subspace of X, and x € X, we write d(x, Z) for 
the distance inff||x-z|| : z € Z} of x from Z. It is apparent that 
d(x, Z) < ||x|| and that, for a given subspace Z, the mapping 
x — d(x, Z) is a seminorm on X. It follows that, if (Z) is a 


sequence of closed subspaces of X, then the mapping 
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x — lim sup d(x, Z,) 


k > 09 


is a seminorm on X, majorized by || ||. This implies that the 


subset lim inf Z,, defined by 
(4) lim inf Z, = {x €X : lim sup d(x, Z,) = 0}, 
k> œ% 


is a closed subspace of X. 

If x € X and e€ > 0, it follows from (3) that there is a positive 
integer m and a linear combination z of e, Te, Te P T” -le such 
that ||x-z|| < ¢; thus z € X_ and 

d(x, X_) < ||x-zi| < €, 


whenever n >m. Hence 


(5) lim sup d(x, K = 0 for each xin X; 


n > œ 


equivalently, lim inf X, = 4. 


For each positive integer n, let x_ be a vector satisfying 
(6) a || T° e—x, || < 2d(T"e, Xx). 


There is a unique linear operator T_, acting on the space xX» such 


that 


T (T¥e) = Tee (k = 0,1,..., 0-2), 
T (T°! e) =X. 
For each element x = aera, Tet ... +a aie of X_, 
n~ n 


| Tx-7, x|| 


n-1 
TEE a [Ttt e-T_ (T*e)]|| 
k=0 


i} 


lla, _ ,(Te-x,)]| 


lI 


lay) WTe-x,1 
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< 2a, _1|KT'e X =) 
= 2da__,T" e, X) 
Since 
o k+1 
no n 
Tx-a,_,T"e = : ae a I= Reaal e 


li 
M 
Q 
a) 
x= 
L 
0) 
a 
Se 


it follows that dla T Ne, X w= d(T x, X ,)> thus 
(7) || Tx-T x|| < 2d(Tx, X a3 (xEeX a) 


Since L is a linear operator acting on the n-dimensional vector 
space X it follows from the elementary linear algebra discussed 
at the beginning of $4.1 that there exist subspaces L „ O< <n) 
of x. such that 


(8) 10} = Lo cS L, a Werke $ L an = X 
(9) L is m-dimensional, 
m,n 
(10) L is invariant under T . 
m,n n 


The following lemma shows how invariant subspaces of T may be 


constructed from suitable sequences of the subspaces L n fi 


LEMMA 4.2.2. Suppose that (mk) ) 4 < k< o and (1k) 4 < k< o 

are sequences of integers such that 0« m(k)< n(k) < n(k+1) 

(k =1, 2, ...), andL is the closed subspace lim inf L a (ke), ne) of X. 
(i) lize X, M Coinda (k=1, 2, ...) and ||2z-Tx, || 0 as 

k — œ, then z € L. 


(ii) L is invariant under T. 


SUPERDIAGONAL REPRESENTATION 161 


Proof. Suppose that z and x, satisfy the conditions set out in 


(i). Since L cnt 


T atok 3 L nlo, n(k)? 


is invariant under Tay it follows that 


thus 


dz Lagona) < llzT atkl 


< z-Tx HTZ Tagok 


This, together with (7), gives 


d(z, L ||2-Tx, | |+2d(Tx,, X 


aoa < n(ky 


< | |2z~Tx, | j+2d(Tx,-z, X )+2d(z, X 


n(k) n(key 


< 3| |z-Tx, ||+2d(z, X ey 


— 0 


as k — œ, by (5). Hence z € lim inf L j= L, and (i) is proved 


m(k),n(k 
If x € L then dx, L eet n(le)> — 0 as k + œ. We may choose x, 
in L nton) such that || x=x; I| < 2d(x, L m(k), a and then 
iteris hile Lego nae? 9 


Part (i) of the lemma, with Tx in place of z, implies that Tx € L. 
This proves (ii). 


The remainder of this section is devoted to showing that the 
sequences (m(k)) and (n(k)) can be chosen so that the subspace L 
occuring in Lemma 4.2.2 is neither {0} nor X. Suppose that a is a 


real number satisfying 
id) 0<a< ||Tel{/||TII. 
Clearly a < |lel! = 1; furthermore, e € X = Lan and, by (8), 


= |le|| = d(e, Lon ) > dle, L lw rasa de La, ) = 0. 
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It follows that, for each n = 1, 2, 3, ..., there is an integer m(n) 
such that 

(12) 0 < mn) <n, d(e, Ln), > a> dle, a 


LEMMA 4.2.3. If(a(k)) is a strictly increasing sequence of 


positive integers, then lim inf L 


mlak), nlk) É 


Proof. Since dle, L aalkaa BOO UK Se ly 2, ...), by (12), 
it follows that e ¢ lim inf Een Gs 

If there is a strictly increasing sequence (0(k)) of positive 
integers such that lim inf L uk) Le) # {0}, it follows from Lemmas 
4.2.3 and 4.2.2 that this subspace is not the whole of X, and is 
therefore a proper closed invariant subspace of T. It remains to 


deal with the case in which no such sequence (((k)) exists. 


LE D) bik 2 = 
EMMA 4.2.4, Suppose that lim inf L CK) LU) = {0} 


whenever (U(k)) is a strictly increasing sequence of positive 
integers. Suppose also that (n(k)) is a strictly increasing sequence 
of positive integers, and (x,) 1s a bounded sequence of vectors 
such that x, € L nak), nlk (k=1, 2,3,...). Then 


[Tx l — 0 as k > o, 


Proof. Suppose the contrary. By passing to a subsequence of 
(n(k)), we may suppose that there is a positive real number 8 such 
that | Tx, | 


> 0 (k=1, 2,...). Since T is compact, we may assume 
also that (Tx,) converges to an element z of X. Then ||z|| > 8 and, 


by Lemma 4.2.2, z € lim inf D So 


lim inf L aaO # t0}, contradicting the hypothesis of the 


lemma. 


SUPERDIAGONAL REPRESENTATION 163 


Proof of Theorem 4.2.1. Since the logical structure of the proof 
is rather complicated, we begin with a résumé of our progress so 
far. We have seen that it is sufficient to consider the case in which 
(2) and (3) are satisfied. In this case, we have defined subspaces 
X (n= 1,2, ...) by (1), and have introduced the operators T, and 
subspaces L (0< m<n; n= 1, 2, ...) satisfying (8), (9) and (10). 
In the discussion preceding Lemma 4.2.4 we have shown that, if 
there is any strictly increasing sequence (f(k)) of positive integers 
such that lim inf L n Cao), 0k) # {0}, then this last subspace is a 
proper closed invariant subspace of T. Henceforth we consider the 
case in which no such sequence (f(k) ) exists; in this case, the 
hypothesis of Lemma 4.2.4 is satisfied, and so the conclusion of 
that lemma is available. 

It follows from (12) that, for each positive integer n, there is a 


vector x such that 


(13) x EL 


n m(n)+1,n’ iex, || ge 


Since T is compact and the sequence (x_) is bounded, there is a 
strictly increasing sequence (n(k)) of positive integers such that 
(Tx) converges to an element z of X. By Lemma 4.2.2(ii), the 


closed subspace L = lim inf L nak) OF X is invariant under 


m(n(k))+1 
T; and, by part (i) of that lemma, z € L. Furthermore, it follows 


from (13) and (11) that 


li 


|| Te-zi | ce ||Te-Tx gll 


A 


pia Sap NT] lle-x, gyll 


4 


aj| T 


A 


|| Tel]. 


164 COMPACT NON-SELF-ADJOINT OPERATORS SUPERDIAGONAL REPRESENTATION 165 


Thus z # 0, and so L # {0}; it remains to prove that L # X. (18) Te = lim fa, = lim @,Tu,+Ty,) = lima, Tu 
Let u, be a vector in L a(n(k))+1n(ke) such that 


l ee Oe te 
isi and similarly Te = lim P Tug By (18) and (14), 


y=1. If we choose a suitable vector v, in 


m(n(k)),n(k) k 

L a(n(k) n(ky 224 replace u, by u,-v,, we may assume that 0 < |[Tel| = lim |a,| [Tu l| < 2I[T|| lim inf |a|, 

Oy EE a GAI EY 

llu || < 2 ultimately bounded, and so has a subsequence converging to a 
k f 


so lim inf jal > 0. It follows that the sequence (B,0;") is 


(14) 
dlu, L nint), ne? =k 
scalar À. As k — œ through the appropriate values, we have 
It follows from (8) and (9) that every element of L n ty yy41 (ie) C88 
| T*e = lim P Tu, = lim Ra Je Tur - ATe, 


be expressed (uniquely) in the form au, +x, where a is a complex 


k 
number and x € L (nk), nk)" By (14), contradicting (2). Hence L # X, and the proof of Theorem 4.2.1 is 
complete. 
(15) flau+xl| > dau, Lniana = lel : 
for each scalar a and each x in L n (n(k)), n(ky 4.3. Superdiagonal representations of compact linear operators 
Suppose that L = X; we shall obtain a contradiction. Since Throughout this section, T denotes a compact linear operator 
e, Te e L = lim inf L nla +l, nly there exist sequences (a, ) and acting on a complex Banach space X. The set © of all closed 


(b) such that subspaces of X is partially ordered by the inclusion relation Q, and 


(16) ap b, € Lb a totally ordered subset F of £ will be called a chain (more 


m({n(k))+1,n(k) 


and precisely, a chain of closed subspaces of X). If each subspace in a 


I|a,—el| 2350) ||b,~Tel| _, 0 chain J is invariant under T, we shall describe F as an invariant 


chain. A trivial example of an invariant chain is the family 


as k W TEF i god 
Bi e can choose scalars a, , Pr and vectors Yw fk such consisting of the two subspaces {0}, X; the existence of non-trivial 


e invariant chains can be deduced from Theorem 4.2.1. 
m Y Zg € L aak), nky? The class Ĉ of all chains is itself partially ordered by the 
) inclusion relation on subsets of £. Suppose that Co is a totally 


ak = SO tV g by = By ty te, 

ordered subset of Ê, and let o one Uh ig ae TE Ca If L,, L, E Fo 

Since the sequences (a,) and (b,) are bounded, it follows from (15), Herthe ersi J J; in Ê such that L € f: and L, € a Since 
, Q $ 

(17) and (14) that the same is true of (2,), (B,), hence also of (y,) 


and (2,). By Lemma 4.2.4, the sequences (Ty,) and (Tz, ) converge 


Ca is totally ordered, we may suppose that J C Jy hence, 


L,, L, E Í; and, since T; is totally ordered, one of the subspaces 


to 0 
o U, and so L,, L, contains the other. Thus Jo is a chain, and is clearly the 
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least upper bound of Co in Č. Since each totally ordered subset of 
C has a least upper bound, it follows from Zorn’s Lemma that @ 
contains maximal elements, which we call maximal chains. 

If § is a chain and C(G) denotes the set of all chains which 
contain G, the the argument of the preceding paragraph can be 
applied, with CG) in place of C, to prove that C(G) contains a 
maximal element G. In fact, 6, is maximal in the class Ĉ of all 
chains; for if F € C and G CF then Cc 6, C F: thus F e€ &G), and 
the maximality of G in C(G) implies that f = Ĝ 


every chain § is contained in at least one maximal chain g.. 


‘a This shows that 


Similar reasoning shows that the class C of all invariant chains 
contains maximal elements, which we call maximal invariant chains, 
and that each invariant chain is contained in at least one maximal 
invariant chain. It is not immediately obvious that maximal elements 
of C, (C Ê) are maximal in C; that is, it is not at once apparent that 
maximal invariant chains are necessarily maximal chains. However, 
we prove in Lemma 4.3.3 that this is so. 

The norm closure of a subset S of X will be denoted by œS). 
Given a subfamily ig of a chain $, the set NÍL : Le Sal is a 
closed subspace of X; the same is true of (UL : Le oe i], since 
Ja is totally ordered by inclusion. Given M in Í, we define a 


closed subspace M + of X by 
(1) M = PUIL: Lef L 7 MY, 


interpreting the right-hand side as {0} if there is no subspace L in Í 
which is properly contained in M. Of course, M_ depends on the 
chain J as well as on M. We describe F as a simple chain if it 


satisfies the following conditions 


(i) {Oh e F 
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(ii) if Jo is a subfamily of F, then the closed subspaces 
Ge) E edal AHUL: L ES] 
are in F- 
(iii) for each M in J, the quotient space M/M_ is at most 
1-dimensional. 
Condition (ii) implies that M_ € F for each M in F. if M has an 
immediate predecessor L in F, then M_ =L, but M_ =M when M has 
no immediate predecessor. By a continuous chain we mean a simple 


chain f such that M =M_ for each M in f. 


LEMMA 4.3.1. Each simple chain 1s maximal. 


; í È 
Proof. Suppose the contrary. Then there is a simple chain $, 
and a chain G which properly contains f, If N is a subspace in 


C ~F then N 4 {0}, X (since {0}, X eF). Let 


Me 8 9 omcians Ss ae | Ge by 8 
M'- Quin: LeF LC NY, 


Since J is simple, it follows that M, M'e f. it is clear that 


M' CNCM and, since N £ F, 
(2) M' F N y M. 


We now show that M' =M_. Since M’ € F and M’ M, the inclusion 
M' CM_ is an immediate consequence of (1). To prove the reverse 
inclusion it is sufficient, in view of (1), to show that M’ contains 
each subspace L in F for which L 7 M. Given such a subspace L, 
it follows from the definition of M that NZ L. Since L, N€ Q 
(which is totally ordered) we deduce that L C N, and so L & M’ by 
the definition of M’. This proves that M' = M_, and (2) implies that 
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the quotient space M/M_ has dimension at least 2, contradicting 


our assumption that Í is a simple chain. 


LEMMA 4.3.2. Each maximal invariant chain is simple. 


Proof. Suppose that Í is a maximal invariant chain. Then 
{0}, X € Í, since otherwise F could be enlarged by the addition of 
these subspaces. If J is a subfamily of f, consider the closed 


subspace 


N=NfiL: L eS} 


of X. Since each L in Fo is invariant under T, the same is true of 
N. Given any subspace M in Í, the total ordering of i implies that 
either (a) M CL for each L in Jo and so M CN, or(b) LCM for at 
least one L in De and so N CM. It follows that F U {N} is totally 
ordered by inclusion and is therefore an invariant chain; by 
maximality of SN 2S. Aswile argument shows that the closed 


subspace 
ALUIL : L eF 
is in Í. 
So far, we have shown that $ has the first two of the defining 


properties of simple chains; it remains to verify the third. Suppose 


that the quotient space M/M_ has dimension greater than 1, for 


ee” axes 
some M in J. Since M and M_ are invariant under the compact linear 


operator T, it follows from Theorem 1.8.3 (i) and (ii) that the 
mapping To: x+M_ — Tx+M_ (x € M) is a compact linear operator 
acting on the Banach space M/M_. By Theorem 4.2.1 there is a 
closed subspace N, of M/M_ such that {0} #N,#M/M_ and 
TW) © Ny. It follows easily from these conditions that, if 


N=tix eM: x+M_ E No, 
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then V is a closed subspace of X and 
(3) M_INGM, TW)CN. 


Given any subspace L in F. we have either (a) MCL, and so N y L, 
ee gM ad yD LEM. J N. Hence N ¢F and F U{N}is 
totally ordered by inclusion. This, together with (3), shows that 

F U{N}is an invariant chain which properly contains F, contradicting 
the maximality assumption. It follows that, for each M in J, the 
quotient space M/M_ has dimensions at most 1. This completes the 


proof that f is a simple chain. 


LEMMA 4.3.3. A chain Í is maximal if and only if it is simple. 
For an invariant chain 5, the following three conditions are 


equivalent: 


(i) F is a maximal chain; 

(ii) Sis a maximal invariant chain; 

(iii) Sis simple. 

Proof. Since a maximal chain is a maximal invariant chain for 
the compact linear operator 0, it follows from Lemma 4.3.2 that 
maximal chains are simple (of course, Theorem 4.2.1 is not needed 
to prove Lemma 4.3.2 in the case T = 0). This, together with 
Lemma 4.3.1, shows that a chain $ is maximal if and only if it is 
simple. 

Suppose now that $F is an invariant chain for a compact linear 
operator T. If F is maximal in the class of all chains, it is a 
fortiori maximal in the class of invariant chains, so (i) implies (ii). 
It has already been noted, in the earlier lemmas, that (ii) implies 


(iii) and (iii) implies (i). 
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THEOREM 4.3.4. Let T be a compact linear operator acting on 
a complex Banach space X. Then there is a simple chain Í of 


closed subspaces of X such that each L in f is invariant under T 


Proof. In view of Lemma 4.3.2 it is sufficient to take for f any 


maximal invariant chain. 


Throughout the remainder of $4.3, we Shall use the symbols 

T, X, $ with the meanings attributed to them in Theorem 4.3.4. If 
: | : 

M €f then either M = M_ or M/M_ has dimension 1. In the latter 


case, Suppose that 


4 
(4) Zy EM ~ M_ 


? 


so that M is i i i iS j 
is the linear span of lzy U M_. Since M is invariant 


under T, Tz, € M- i 
} Zy €M; thus there exist a scalar ay and a vector Yy 
such that 


Š a 
(S) Pay = Gy 2yt¥y Yy EM. 


It ° « . . d. 
1s easily verified that ay does not depend on the particular 
choice of ZyinM~M_. Since M_ is invariant under T-a}, and 


(T-a) Zy E€M_, it follows that 
(6) (T-a,,D) OW) CM. 


When M e€ f and M = M_, we define ay =0. In this way we 
associate with each M in F a scalar ay which we call the diagonal 
coefficient of Tat M. In the finite-dimensional case, the elements 
Ep :M eF? forma basis, which inherits a total ordering from f; 
the matrix of T with respect to this ordered basis is Superdiagonal, 
and has diagonal entries ay M€ F), 

The diagonal multiplicity of a scalar a is defined to be the 


(possibly infinite) cardinal number of the set ÍM: M €F a- a} 
s, “ng TaK 
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LEMMA 4.3.5. Suppose that K ts a compact linear operator 
acting on a complex Banach space X, and ò is a positive real 
number. Then there 1s a finite subset S of the unit ball 
K EEA Ixi} < 1} of X which has the following property: if 
x € X., there exists y in S such that |IKx-Ky|| < 6. 


Proof. Suppose that no such set S exists. We construct an 
infinite sequence (x) of elements of X}, inductively, as follows: 
x, is an arbitrary vector in X, and, when x,, ..., X 4 have been 
selected, we choose x in X, so that ||Kx -Kx | > ò forl<m<n. 
The existence of a suitable vector x, follows from the assumption 
that the finite subset eae rr ae of X, cannot have the property 
required for S in the lemma. 

In this way we obtain a bounded infinite sequence (x_) of 
elements of X such that ||Kx -Kx || >ô wheneverm#n. Clearly 
(Kx) has no convergent subsequence, contradicting our assumption 


that K is compact. This proves the existence of a set S with the 


stated property. 


LEMMA 4.3.6. Given a positive real number 0 and a non-zero 
subspace M in F, there exists L in Í such that L y M and 


d(Tx, L) < 6||x|| for each xin M_. 


Remark. The interest of this lemma lies in the case in which 


M =M_; the result is trivial, with L = M_, when M # M_. 


Proof. By Lemma 4.3.5, with K the restriction of T to M_, 
there is a finite subset S of the unit ball of M_ with the following 
property: if x €M_ and ||x|] < 1, there exists y in S such that 
|| Tx-Ty|| < 46. If y €S, then y € M_ and so 


TyeM_= Æ[UIL: Lef, L 7 M3); 
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so there is a subspace L, such that the last inequality follows from Lemma 4.3.6, since x+y €EM=M_. 
L eS, L, 7 M, d(Ty, L) <Ò (y €S). By taking lower bounds as y varies in L, we obtain 
\Al d(x, L) < d(x, L), contradicting our earlier assertions that 
0<d< JA] and dx, L)>0. Thus M4 M_. 
Since M is the smallest subspace in F which contains x, and 


M ZM, it follows that x ¢ M_. However, by (6), 


Since S is finite and f is totally ordered by inclusion, the family 


iL : y € S} has a largest member L, and 


el a eae eS EE O EEN ONON 


LES L 7 M, dTy, L) < %28 (y €S). 


If X EM _ and | |x] | < 1, there exists y in S such that ||Tx-Ty!| < Y8; (A-a,,)x = (T-a,,1)x E M_, 
So À—a = 0. 


thus M 


d(Tx, L) « d(Tx-Ty, L)+d(Ty, L) 
The lemma just proved shows that a non-zero eigenvalue of T is 


< ||Tx-Ty||+d(Ty, L) < ô. i i l i 
a diagonal coefficient. We now have a result in the opposite 


By homogeneity, d(Tx, L) < ô| |x|| for each xin M_. Aredidn. 
LEMMA 4.3.7. Suppose that À is a non-zero eigenvalue of T, LEMMA 4.3.8. IfM cf and ay 40, then a,, is an eigenvalue 
x 1S a non-zero vector satisfying Tx = Ax and of T. If ay has index 1 relative to T, there is a vector x in M ~ M_ 
M=NiL:LeF, xeLh satisfying Tx = a,x. 


es Proof. Let T” be the compact linear operator obtained by 
ThenM eS, x eM ~M_ and ay =À. 
restricting T to M, and let Ñ and R denote the null space and 
Pa l q ; 
Proof. Since $ is a simple chain, M € 5; clearly M is the (closed) range space, respectiyely, of TM ag l. Since ay #9, we 
smallest subspace in f which contains x, and M £ {0}. have M £M_; and, by (6) 
If M = M_, let 0 be a real number such that 0 < 8< ‘Al, and 


_ ¢7M = a 
choose L satisfying the conclusions of Lemma 4.3.6. Since L €F R= (T -ayl) M) = (T-a) M) C M_ £ M. 


a l 
and L y M, it follows that x é L, so d(x, L) > 0. Furthermore, if It follows from Theorem 1.8.1 that ay is an eigenvalue of Ty (and 
y EL then Ty ¢ L and thus hence, also, of T). 


IAld(x, L) = d(Ax, L) = d(Tx, L) If ay, has index 1 relative to T, the operators T-a,/ and 


(T-ay1)° have the same null space; this property is preserved when 


| 


= ATx+Ty, L 
y, L) we restrict to M, so Quy has index 1 as an eigenvalue of Ty By 


Ô| | x+y! p Theorem 1.8.1, 


AN 


M =R). 
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Since R C M_, it follows that Jl meets M ~ M_; that is, there exists 
xin M~M_ such that A yX = e E 


LEMMA 4.3.9. If is a non-zero diagonal coefficient of T, 
then 1ts diagonal multiplicity 1s finite and is equal to its algebraic 


multiplicity as an eigenvalue of T. 


Proof. Let d denote the diagonal multiplicity, m the algebraic 
multiplicity and v the index of A relative to T. For each integer 
k( 0), let It be the null space of (T-N)*,. Then Le is 
m-dimensional, and 


y= N INF... SM, =, =H, 


2 5o- 


We show first that it is sufficient to consider the case in which 


v = 1. For this purpose we introduce the compact linear operator S 
defined by 
pI-S = (AI-T)”, 


Us oe 
Where p = A. Since ul-S and (u 1-85)? have the same null space 
(= ys of dimension m, it follows that p is an eigenvalue of S 
with index 1 and algebraic multiplicity m. Since S is a polynomial 
in T, each subspace in the simple chain J is invariant under S, and 
we may consider the diagonal coefficients Buy (M € F) of S. If 
MeéeJ,M#M_ and Zy Yy Satisfy (4) and (5), then 

Szy = zy -S) zy 
= UZy-AL-T) zy 
z H2ylA-ay (ay -T zy 
v 

z -A-a y) zys > i (*) A-a) = ayl-TY’ 24 

Y 

= [p-A-ay) Zy Yy 


where, by (6), Vy €M_. Hence 
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(7) Pu = -A-a y) á 


(the same equation is satisfied when M € f and M=M_, since 
ty = By = 0 in that case, and u = AN. It follows that By = u if and 
only if ay =A, so p has diagonal multiplicity d as a diagonal 
coefficient of S. It is now sufficient to prove the lemma under the 
additional hypothesis that A has index 1 relative to T, since in the 
general case we can reduce to this situation by considering S, u in 


place of T,A respectively. 


Suppose, therefore, that v = 1. If d> m, there exist subspaces 


M(0), M(1), ..., Mm) in F such that 
C z C 
M(0) v M(1) Dasa M(m) 
and Oy) = à (j=0,..., m). Since MG-1) 7M), it follows that 
MG-l) CMG) Ge=l,..., m 


By Lemma 4.3.8 there exist vectors Xp Xp vey Xn such that 


Tx. = Ax. and 
J J 


(8) x, € MG) ~M@. G=0,1,.. m) 


The vectors Xp, Xp - ++ x_, lie in the (m-dimensional) null space of 
T-A, and are therefore linearly dependent. Hence some 
x, (1< i< m) is a linear combination of its predecessors and, for 


suitable scalars Ao: Ay ree Aas 


x, = ÀoXotÀ; Kees the 4X4 e MG-1) C MQ), 


contradicting (8). Thus d< m, and so d is finite. 

There are just d distinct subspaces M in F such that ay = A; let 
these be M(1), ..., M(d). Since Oy (5) Æ 0, the quotient space 
MG)/MG)_ has dimension 1, and there is a continuous linear 


functional ý, on M(j), with kernel > "(0) equal to MGj)_. By the 
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Hahn-Banach theorem, Y, extends to a continuous linear functional 


Y, on X, and we have 
(9) M(D_={xeM): 9 (x) = 0j, 


If m >d, there is a non-zero vector x, in the (m-dimensional) null 
space of T~AlI which satisfies the d linear conditions 
P(x) =OG= 1... FM=NİL: L eS, xy EL}, then Lemma 


cc 
4.3.7 asserts that M € $, xo EM ~ M_ and Ay = A. The last condition 


implies that M = MG) for some j (1<j < d), and we have 
aS M(j) ~ MC) P(o) = 0, contradicting (9). It follows that 


m < d; since we have already proved the reverse inequality, m = d. 


The following theorem summarises the main results obtained in 


the preceding lemmas. 


THEOREM 4.3.10. Suppose that T 1s a compact linear operator 
acting on a complex Banach space X, and f isa simple chain of 
closed subspaces of X, each of which is invariant under T. Then 

(i) a non-zero scalar À is an eigenvalue of T if and only if it is 
a diagonal coefficient of T; 

(ii) the diagonal multiplicity of A is equal to its algebraic 
multiplicity as an eigenvalue of T; 

(iii) the operator T is quasi-nilpotent if and only if T(M) CM_ 


for each M in 1. 


Proof. The only statement not already proved is (iii). Now the 
spectrum of T consists of 0 (except, possibly, in the finite- 
dimensional case) and the non-zero eigenvalues of T. From part (i) 
of the theorem it follows that T is quasi-nilpotent if and only if 
dy, = 0 whenever M € Ff, IfM cf andM #M_, then (4), (5) and (6) 
imply that a,, = 0 if and only if TW) C M_; when M = M_, the 
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conditions a, = 9 and T(M) C M_ are both satisfied automatically. 
lence T is quasi-nilpotent if and only if T(M) < M_ for each M in J, 


COROLLARY 4.3.11. If T is a compact linear operator acting 
on a complex Banach space X, and there is a continuous chain Í of 
closed subspaces of X, each of which is invariant under T, then T 
is quasi-nilpotent. 

Proof. For each M in F, T(M) CM =M_, so the result follows 
from part (iii) of Theorem 4.3.10. 


EXAMPLE 4.3.12. Suppose that X is the Hilbert space L,(0,1) 
(Lebesgue measure) and that, when 0<A<1, L} is the closed 


subspace of X defined by 
Ly = {x eL,(0,1) : x(s) = 0 almost everywhere on [0, AR. 


Since Ly G L, when0<p<AÀ< 1, the fany Sty SORA 1 


is a chain of closed subspaces of X. It is clear that 
(10) be 200). Lesz 


We assert also that, if S is a non-empty subset of [0,1] and 


a = inf S, B = sup S, then 

a) NIL, : Ae S}=L,g, cAIUIL) AESI = La 
As a special case of the second relation, we have 

(12) cl[U{L, : a<Ax Il =L, O<a<}). 


To prove the first equality in (11), let (A(n)) be an increasing 


sequence of elements of S, with the limit B. If 


X eNiLy oA eS}, 
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then x € NGS and so x(s) = 0 for almost all s in [0,A(n)] 
(n = 1, 2, ...); thus x(s) = 0 for almost all s in (0,81, and x eL 
This shows that 


B 
NiL, AES Le, 
and the reverse inclusion is apparent. To prove the second part of 
(11), suppose that x € L., and let (4(m)) be a decreasing sequence 
of elements of S, with limit a. If i is the element of L (0,1) which 
vanishes on [0,u(n)] and coincides with x on [u(n), 1], then 

xE L in) and, since x vanishes on [0,a], 


lxx ll? = J la(s- (9) 2ds 


0 
Kn) 
= J |x(s)|2ds — 0 


as n— œ, Thus 


x E€ ch{UiL a i ns 1, 2, J] ¢ c@LUIL) : Ae Si. 


It follows that L, ¢ cPLUIL) : À € SI]; since the reverse inclusion 
is obvious, the proof of (11) is complete. 

Since every subfamily of f fas the form iL) : A €S} for some 
subset S of [0,1], we can deduce from (10), (11) and (12) that F has 
the three defining properties of Simple chains (with M_ = M for each 
M inf). Thus Í is a continuous chain of closed subspaces of X. 

Let h be a Lebesgue measurable complex-valued function on the 


square [0,1] x [0,1], and suppose that 
his, =0 (O<s<t<1), 


1 .S 
a ds! |A(s, O| dt < 00, 
0 0 


The associated integral operator T, acting on L (0,1) is defined by 
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S 
(13) (T,x)(s) a ACs, t)x(t)dt 


(x € L,(0,1) : 0x s< 1). 


IfO<A<lamdxeé Ly, then the function x vanishes almost 
everywhere on [0,A]; it follows from (13) that TX has the same 
property, so J, x € L). Thus T, leaves invariant each subspace in 
the continuous chain J and is therefore quaSi-nilpotent, by 
Corollary 4.33.11. Operators of the type defined in (13), in which 
(Tx) (s) is ain integral over [0,sl, are known as Volterra integral 
operators. 

It followss easily from the preceding discussion that, if H is a 
separable inffinite-dimensional Hilbert space, then there is a 
continuous cthain of closed subspaces of H. To prove this, note 
first that L (0, 1) is infinite-dimensional, and is separable since 
polynomials,, in which each coefficient has rational real and 
imaginary paarts, form a countable everywhere dense subset. Thus 
H and L (0,11) are both isometrically isomorphic to UC), where N 
is the set off positive integers. It follows that there is an isometric 
isomorphism: U from L,(0,1) onto K; and it is apparent that, if $ is 
defined as aibove, then the family {U(L): L € F} is a continuous 


chain of subospaces of K, 


4,4. Superdiagonal integrals 
This section is concerned with the superdiagonal representation 
of a compactt linear operator acting on a Hilbert space. In this 


context, the: theory developed in $4.3 can be strengthened 


considerablyy by use of the one-to-one order preserving correspondence 


between closed subspaces and projections (see GET especially 


Lemma 1.7.110 and Theorem 1.7.11). Our main interest is in the 
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following question: given a compact self-adjoint operator K acting 


on a Hilbert space H, anda Simple chain f of closed subspaces of 


H, in what circumstances is there a quasi-nilpotent operator T which 


has skew-adjoint part K and leaves invariant each subspace in F? 
Before considering this problem, we shall express some of the 
results of $4.3 in a slightly different form. 

The projection from a Hilbert space K onto a closed subspace M 
will be denoted by E(M). If T € RGO and M is invariant under T 
then, given any x in H, we have TE(M)x € T(M) CM and therefore 
TE(M)x = E(M)TE(M)x; thus TE(M) = E(M)TE(M). Conversely, if 
T e BK) and TE(M) = E(M)TE(M) then, for each x in M, 


Tx = TE(M)x = E(M)TE(M)x eM; 


and thus, T(M) CM. It follows that a closed subspace M of K is 
invariant under T if and only if TE(M) = E(M)TE(M). 

Suppose now that T is a compact linear operator acting on a 
Hilbert space H, and F is a simple chain of closed subspaces of K, 


each of which is invariant under T. Then 
(1) TE(M) = E(M)TE(M) (M €f) 
and, since f is totally ordered by inclusion, 


(2) E(L)E(M) = E(M)E(L) = E(L AM) (L, M ef). 


Note also that, if L, M eS, and L CM, then E(M)-E(L) = EM ALT), 


Equation 4.3(1) implies that 
(3) E(M_) = V{E(L) : Lef, L 3M} (M eF). 
IfM cf andM #M_, the element Zy of 4.3(4) can be chosen so that 


(4) zy EMN MO”, ilz] = 1 Me FS MeM_). 
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It then follows from 4.3(5) that 


(Tay Z4) = ay [Zm m + Zm = "w 
since y,, € M_; so the diagonal coefficient a, of T at M is given by 
(5) ay = Tiy T (Me SM+M_). 


For each M in Í, the projection E(M)-E(M_) will be denoted by 
P(M). If M #M_, then M is the linear span of z, and M_ and so, 
for each x in H, there exist a scalar a and a vector y in M_ such 


that E(M)x =az,,+y. It follows that 
E(M_)x = E(M_)E(M)x = y, 


P(M)x = E(M)x-E(M_)x = azy. 


Since y € M_ and zy € (M, 
a = Lazy+y, Zy> = (EM)x, Zy> 
= €x, EM)zy> = <x, Zy>- 
Thus 
(6) P(M) = E(M)-E(M_), P(M)x = <x, DT 


whenever M cf, M#M_andxeŅ. A straightforward computation, 


based on (6) and (5), now gives 
(7) P(M)TP(M) = a,,P(M) (Mef, M£M_). 


The following result, which is a variant of Theorem 4.3.10(iii), 
is an immediate consequence of part (i) of that theorem, together 
with (7). 


LEMMA 4.4.1. Suppose that T is a compact linear operator 


acting on a Hilbert space H and Ī is a simple chain of closed 
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subspaces of H, each of which is invariant under T. Then T is 
quasi-nilpotent if and only if P(M)TP(M) = 0 whenever M € F and 
M£éM_. 


We now turn to the problem of main interest in this section. Our 


first result is superficial, but is sometimes useful. 


THEOREM 4.4.2. Suppose that f isa simple chain of closed 
subspaces of a Hilbert space K, and K is a compact self-adjoint 
operator acting on K. A necessary condition for the existence of a 
quasi-nilpotent operator T which has skew-adjoint part K and leaves 


invariant each subspace in Í is 
(8) P(M)KP(M) =0 (Mef, M#ZM_). 


If such an operator T exists, it is unique, and is a compact linear 


operator. 


Proof. Suppose that there is an operator T with the stated 
properties. Since Im(T) (= K) is compact, it follows from Theorem 
1.8.8. that T is compact, and Lemma 4.4.1 asserts that 
P(M)TP(M) = 0 whenever M € f and M 4M_. By taking adjoints, 
we obtain P(M)T*P(M) = 0, so 


P(M)KP(M) = %iP(M)(T*-T)P(@) = 0 (MeF MEM_). 


If T} and T, are two such operators, the above reasoning shows 
that both are compact and satisfy P(M)TP(M)=0 (M e Ī, M 4 M_). 
Thus T-T, is compact, leaves invariant each subspace in T and 
satisfies P(M)(T,-T,)P(@M) = 0 (M €$, M £M_). By Lemma 4.4.1, 
T-T, is quasi-nilpotent; it is also self-adjoint, since 

Im(T,-T,) = Im T,-Im T, = K-K = 9, 


so it follows from Corollary 1.7.5 that T,-T, = 0. 
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Our next objective is to find an explicit formula for T, in terms 
of f and K, when T, F, K are related as in Theorem 4.4.2. Before 


this can be done, we need some auxiliary results. 


LEMMA 4.4.3. Suppose that F is a simple chain of closed 
subspaces of a Hilbert space K, K is a compact linear operator 
acting on H, P(M)KP(M) = 0 whenever M € F andM 4£M_, and dis 
a positive real number. 

(i) IfM eF andM #10}, there exists L in F such that L g M 
and l 
\|[E(M)-E(L)IKLE(M)-E(L)]|| < 6. 
Gi) IM c$ andM # K, there exists N in F such that My N and 
\\[EW)-E(@)IKLE(W)-E(M)]|| < ò. 


Proof. By Lemma 4.3.5 there is a finite subset S of the unit 
ball K = {xeH : |ixl| < 1} of KH which has the following property: 
given any x in he, there exists y in S such that ||[Kx-Ky|| < 720. 
The set | 


U = {A e BH) : ||AKy-E(M)Ky|| < %28 for each y in S$} 


is a neighbourhood of E(M) in the strong operator topology. 
We now prove (i). If MAM_, then 


[E(M)-E(M_)\KLE(M)-E(M_)] = P(M)KP(M) = 0, 
and it suffices to take L = M_. Now suppose that M = M_, so that 
EM) = E(M_) = VIEL) : L ef, LIM. 


By Theorem 1.7.11, the set IE (L :Le F L ZM} has E(M) in its 
strong closure, and therefore meets the strong neighbourhood U of 


E(M). Hence there is a subspace L satisfying 
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(9) L eF,L GM, ||E(L)Ky-E(M)Ky|| < 48 (eS. 


Given any x in Haz there exists y in S such that ||Kx-Ky|| < 46. 
This, with (9), gives 


||[EW)-E(L)\K x| | 
< ||[E()-E(L)\(Kx-Ky)||+| |[E()-E(L) Ky || 
< ||Kx—Ky||445 < ð; 


so ||[E(M)-E(L)]Kx|| < 5 whenever x eK. Thus 
|[[EM)-E(L)]K]| < 4, 


and this implies the inequality required in (i). 


To prove (ii), we consider the subspace 


(10) L =NİN : Nef, MGN} 


Then M C L and, since f isa simple chain, L € F. clearly there is 


no subspace N in F such that M 7 N 7 L. If L £M, then M is the 


immediate predecessor of L in F, so M = L_; in this case 
[E(L)-E(M)IKLE(L)-E(M)] = P(L)KP(L) = 0, 
and it suffices to take N = L. Now suppose that L = M, so that 
E(M) = E(L) = A(E(W): N eF, MGM} 


By Theorem 1.7.11, the set {E(V) : N ey. M GN} has E(M) in its 
strong closure, and therefore meets the strong neighbourhood U of 


E(M). Hence there is a subspace N satisfying 
Necef, M 7 N,  ||E(WV)Ky-E(M)Ky|| < %8 (y €S). 


An argument, similar to the one used above to prove part (i) of the 


lemma, now shows that 
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|[[EWM-EM]K]| < 8, 
and this implies the inequality required in (ii). 
Suppose that f is a simple chain of closed subspaces of a 
Hilbert space K. By a partition of f we mean a finite subset ? of F 


which contains the two trivial subspaces {0} and HK. We may Suppose 
that P = iM o; M,, i ws M$, where 


If K e RF), the diagonal part of K relative to P is a bounded linear 
operator, denoted by D(K, P) and defined by 


(11) DK, ?) = , z [EWM )-EM_DIKLEM )-E(M _,)I. 


If we write E, for E(M JEM 1), then E,,..., E are pairwise 
orthogonal projections with sum J. For each x in H 
||D(K,P)x||* = || 2 E KE x||? 
j=1 J J 
= 5 ||E KE. x||? 
= 1 J J 
. 2 2 
< È IE KE iE; 
< {max ||E KE ||]? = |[|E xl]? 
jx J J Jsl J 
= [max ||E KE_||2 ||xl|2. 
1<jKn J J 
Thus 
(12) ||D(K, P)|| < pe ||E KE || < |]K[]. 
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Now suppose that I is another partition of J: consisting of the 


subspaces {0; = No 7 N, 7 F FN =H, and write F for 
E(WV,)-EW,,) (k=1,..., m). 169 CP, then, for each 

k=1, 2, ..., m, there is an integer r such that 1 < r< n and 

M i CN. CN, CM. Thus FOS E and so PE. = E F s0 


when 1 <j<nandj-#r. It follows that 


| 


F KF; = F EKE F 


z a 3 E KE )F, = F,DK, PYF, 
and summation over k = 1, ..., m yields 
D(K, ?,) = DDK, ?),? ) it P CÈ. 
This, together with (12), implies that 
(13) IDK, ? I) < JDK, P|] i£? cP. 


LEMMA 4.4.4. Suppose that f isa simple chain of closed 


subspaces of a Hilbert space H, K is a compact linear operator 


acting on K, P(M)KP(M) = 0 whenever M c F and M # M_, andò isa 


positive real number. Then there is a partition P of F such that 


|| D(K, P)|| < 8. 


Proof. We shall say that a subspace M in $ is satisfactory if 


there exist My, M,, ..., M in J such that 
a C a 
(14) (OS = Mo ZM, 3... 7M, =M, 
(15) LEM )-E(M _ IKLEM )-E(M 1)! Oy dy 2 by), 


We have to show that H is satisfactory; for then, the subspaces 


Mp» waa M. satisfying (14) and (15) form a partition ? of J, and 


||D(K, ?)|| < ô by (12). The proof that H is satisfactory is divided 


into three stages. 
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(a) There exists N in ¥ such that N # {0} and N is satisfactory. 
By Lemma 4.4.3 (ii), with M = {0}, there exists a non-zero subspace 
N in Í such that ||E(V)KE(N)|| < ê. By taking Mọ = {0} and M] =N, 
it follows that N is satisfactory. 


We now define a subspace M in F by 
(16) M = VIN : N ef, N is satisfactory}. 


(b) M is satisfactory. It follows from part (a) of the proof that 
M #40}, so Lemma 4.4.3 (i) implies the existence of a subspace L 


such that 


(7) Lef, LIM, ||[EQD-EŒKLEM-EL)I|| < 6. 


Since L is a proper subspace of M, we deduce from (16) that there 
is a satisfactory subspace N in F such that LC N CM. Hence there 


exist subspaces Mo, ..., M in J such that 
(O}= My 7M, 9... JM, =N, 


and (15) is satisfied. Since L CN CM, E(M)-E(N) is a subprojection 
of E(M)-E(L) and so, by (17), 


||LEWD-EWV)IKLE(M)-E(N)]|| < ô. 


By considering the subspaces M M,,..., Mp Mp where M41 = M, 
it now follows that M is satisfactory. 

(c) H is satisfactory. Suppose the contrary so that, by part (b) 
of the proof, M # HK. Since M is satisfactory, there exist subspaces 
MyM .-- M, in F such that (14) and (15) are satisfied. By Lemma 
4.4.3 (ii) there is a subspace M ai in J such that 


M=M, 3M LEQ, )-EW NKLEM, )-EW )I|| < 6. 


r+l’ +1) r+l 
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It is clear that Moyi is satisfactory, and this contradicts (16) since 


M 7 M41: Sence H is satisfactory, and the lemma is proved. 


Suppose that F isa simple chain of closed subspaces of a 


Hilbert space K, and f isa partition of T where 


? = {My M,, .... M}, 


0} = Mo ZM, 9. FM, =K. 


n 


(18) 


If K e BCH), the bounded linear operator S(K, P ) defined by 
n 
(19) S(K, f) = Z EM, _ KIEM )-EM,_;)) 
p= 
is called the superdiagonal part of K relative to J: Clearly 
S(K, P) M) C Mey (j=1,..., n). Given any non-zero subspace M 
in J, there is an integer j such that My 7 MC M thus 
M. A CM CMCM. and 
j= = j 

SCK, P)(M) C SK, PYM) CM, CM. 
It follows that 
(20) S(K,?)(M) CM_ MeF). 


If we write E, for E(M )-E(M |), then the adjoint S(K, Py is given 
by 


ii 


* $ * 
SK, P) 2 , LEM )-EM,_; JIK EM) 


n j-1 
=. 2 EK > E) 
2 J k= 1 


ll 
IMi 
M 
5 

N 
% 
by 


n-1 
= -EMIK “TEM -EM 1) 


SUPERDIAGONAL REPRESENTATION 189 


= © UWE )IK*EM )-EM_,)I. 
ad j j 5 


If K is self-adjoint then 


S(K, P)+S(K, P)* 


li 


I ` i U-E(M )+E(M, DIKLEM )-EM_,) 


i 


k 2 [EM )—-E(M,_ ,)JKLE(M,)-E(M,_ 1)! 


= K-D(K, P), 
where D(K, Î) is the diagonal part of K relative to ?. Thus 
(21) K = S(K, ?)+S(K, ?)*+D(K, 9) 


if K = K* e BK). 


If 2 is a collection of subspaces satisfying 


Le Wee bt: bed, 
(22) ; 


M., CL. ZM, Ge1,..., n), 


then the ‘Riemann-Stieltjes sum’ 
n 
(23) S(K, ?, 2) = = EL, KLEM )-EM,_,) 
j = 


is a bounded linear operator on H; the superdiagonal part S(K, P) is 
an example of such a sum (obtained by taking L, = M1). We say 


that the integral 
J. F E(M)KE(M) 
converges if there is a bounded linear operator A on H with the 


following property: given any positive real number ò, there is a 
partition (8) of F such that 
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[|A~S(K, ?, 2Q)|| < 8 


whenever the partition P of F contains ? (6) (and, of course P and Q 
are related as in (18) and (22)). When this condition is satisfied, 


we write 
(24) A = Jeg E(M)KE(dM), 


and call A the superdiagonal integral of K relative to J By taking 
Te U eE M15 it follows that ||4-S(K, P| < ô whenever f 
contains (5); so A is a norm limit of operators of the form S(K, ?) 
and, by (20), 


(25) A(M) CM_ M ef). 


LEMMA 4.4.5. Iff is a simple chain of closed subspaces of a 
Hilbert space K, K e BK) and the superdiagonal integral 
Jeg E(M)KE(dM) converges, then P(M)KP(M) =0 (M €F, M £M). 


Proof. Given any positive real number 6, let P(S) be a partition 
of F such that ||A-S(K, 2, 2)|| < 8 whenever P contains ?(S), where 
A is the superdiagonal integral. If M eS and M #M_, let 
Fe iM y M iy M 3 be the partition consisting of the members of 
P(S), together with M and M_. Since M_ is the immediate 
predecessor of M in J, there is an integer k such that M4 =M, 

M; =M. We now consider the Riemann-Stieltjes sums S(K, P. 2.) 
and S(K, ?, 2), where a. = iM, ..., M 
A by altering the kth. element toM, ,. Since 
||A-SP, K, 2)|| < 8 G = 1,2) and 


a and 2, is obtained from 


SKK, ?, 2,)-S(K, P, 2) 


[ECM -EM _ KLEM )-EM 1) 
P(M)KP(M), 


ii 
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it follows that ||P(KP(M)|| < 26. This has been proved for every 
positive 8, so P(M)KP(M) = 0. 


We can now prove the main result of this section. 


THEOREM 4.4.6. Suppose that Í is a simple chain of closed 
subspaces of a Hilbert space K, and K is a compact self-adjoint 
operator acting on K. In order that there exist a quasi-nilpotent 
operator T which has skew-adjoint part K and leaves invariant each 
subspace in F itis necessary and sufficient that the superdiagonal 
integral of K relative to f should converge. When this is so, the 


unique operator T with the stated properties is given by 
T2 Jeg E(M)KE(dM). 


Proof. First, suppose that the superdiagonal integral converges, 
and denote its value by A. Then A is the limit, in norm, of operators 
of the form S(K, ?) defined by (19). Since each S(K, P) is compact, So 
is A; and, by (25), A(M) CM_ whenever M € $f. It now follows from 
Theorem 4.3.10 that A is quasi-nilpotent. 

Since the superdiagonal integral converges, Lemma 4.4.5 implies 
that P(M)KP(M) =0 whenever M eS andM4M_. If 6 is a positive 
real number then, by Lemma 4.4.4, there is a partition ip of f such 
that ||D(K, FI < 8; it follows from (13) that 


(26) \|D(K, ?)|| < & whenever Po ey, 


By definition of the superdiagonal integral, there is a partition P. 


of F such that 
(27) ||A-S(K, P)|| < & whenever P, cP. 


If Pisa partition which contains both se and J then, by (21), (26) 
and (27). 
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\|K-A-A*|| = ||[SCK, P)-Al[S(K, P)-AI*+D(K, P)|| 
< 38. 


Since this holds for every positive 5, it follows that K = A+4*. 
Since A is quasi-nilpotent, A(M) CM_ CM whenever M eS, and 
A+A™ = K, the operator 


T = 2iA = 2i Jeg E(M)KE(dM) 


is quasi-nilpotent, leaves each subspace in J invariant, and has 
skew-adjoint part K. 

Conversely, suppose there is a quaSi-nilpotent operator T such 
that Im T = K and T(M) CM for each M in f. By Theorem 4.4.2, T 
is compact, and P(M)KP(M) = 0 whenever M c F and M ZM_. We 
may set T = 2iA, where A is compact and quasi-nilpotent, A(M) CM 
for each M in f, and A+A* = K. By Lemma 4.4.1, P(M)AP(M) = 0 
whenever M e F and M AM_. It now follows from Lemma 4.4.4 that, 
if ò is a positive real number, there exist partitions P, and I. of F 
such that ||D(K, 2,) || < § ||D(A,¥,)|| < 8. If Ps =P, UP, then, 
by (13), 


(28) ||DK,P)|| < 8, ||D(A,P)|| < ô 


whenever Í. Ç 1a 

Suppose that = E M 3 is a partition of F, and 
ae Serre ome 
and F, for the projections EM )-EM 1) and EL )-EM _ 4) 


respectively; then 0 < F, < E, while E,, ..., E are pairwise 


satisfies the usual condition (22). If we write E. 


orthogonal; so the projection F = F}+... P satisfies 


FE, = F, G=1,..., n). It follows from (19) and (23) that 
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| 


SK, 9, D-SK, 9) = E [EC -EM KLEM )-EM,_,) 


_ S FKE 
j=1 J J 
= F S EKE. = FD(K,?). 
ped i 
Thus 
(29) ISK, ?, 2)-S(K,P)|| < ||D(K, P). 


In order to estimate ||A-S(K, P\j||, we first note that 
(30) EM. )ATEM)-EM,_ l= 0 Gsl... n), 
(31) U-E(M )IALE(M)-E(M 1 = 0 G=4,..., 0). 


Indeed, since the subspaces M, and M. are invariant under A, the 


left-hand side of (30) is the adjoint of 
[EM -EM EM, AEM) = 9, 
while the left-hand side of (31) is 


U-E(M AEM LEM )—E(M -_,)! 
= (LEM )\E(M,)AEM LEM )-EM,_,)] = 0. 


Since K = A+A", it follows from (19), (30) and (31) that 
SK,?)= È , E(M._,)(A+A*)LEM)-EM 4)! 
= 2 E(M ,_,)ALE(M )-E(M ,_,)1 


= z U-E(M HEM, _ AEM -EM _,) 
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A- È [EODEM , NAEM -EM 


A-D(4, ?). 


Thus A-S(K, ?) = D(A,P) and, by (29) and (28), 


|| ASK, P, D| < ||A-S(K, PIISK, P-K, 7, D| 
< ||D(A, P) DK, P)|| < 26, 


whenever the partition P contains f: . This shows that the 
superdiagonal integral of K relative to f converges, and that its 


value is A. Since 
T = 2iA = 2i Jeg E(M)KE(dM), 


the theorem is proved. 


In view of Theorem 4.4.6, it is desirable to find conditions on K 
which ensure the convergence of the superdiagonal integral 
| E(O)KE(dM). Before proving our first result in this direction, 
Theorem 4.4.10, we require some lemmas. As in Chapter 2, ee 
denotes the von Neumann-Schatten class of operators, || is is the 
usual norm on C and 7 is the trace on C We recall from $2.4 
that the Schmidt class C, is a Hilbert space, with inner product 
L, ] defined by [S, T] = 7(T*S). 


LEMMA 4.4.7. Suppose that isa simple chain of closed 
subspaces of a Hilbert space H, K is a self-adjoint Schmidt class 
operator on K, P = [Mo er M3 is a partition of § and 
De iL,, datas Li} satisfies the usual condition (22). Then the 
operators D(K, PY, SK, P) and SK, P, 2), defined by (11), (19) and 
(23), all lie in the Schmidt class, and 
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(i) ISK, ?, D-K, P|], < IIDE, Ply 


(ii) DK, P| 2+ |IK-DE, P32 = [IKZ 


Proof. Since A is an ideal in BCH) and K € CR it is apparent 
that D(K, P), S(K, Py SK, A Jg Cy We have seen, in proving 
Theorem 4.4.6, that 


SK, ?, 2)-S(K, f) = FD(K, P), 
where F is the projection >i E(L )-E(M, _1)3; thus 
ISK, ?, D-K, Pll, < HEI IIDE, Poll, = DK, Pl. 


If we write E. for the projection E(M )-E(M 1), then E}, ..., En 


are pairwise orthogonal and have sum I. Note that 


ti 


[E KE., E KE | = (E KE D*E, KE) 


{E KE E KE ) 
q pr s 


{KE E KE E), 
p r S q 


and the right-hand side is 0 unless p = r and q = s. Hence the 
operators {E KE: r, s= 1, ..., n} are pairwise orthogonal vectors 


in the Hilbert space Ce It follows that the vectors 


DK, £) = = EKE, 


r= 


K-D(K, P) = & EKE 


in C; are orthogonal, and therefore 
NKI|2 = ||DK,P)tK-DK, PXI 
= ||D(K, ?)||3 + ||K-DK, P)||5. 
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LEMMA 4.4.8. Suppose that f is a simple chain of closed 
subspaces of a Hilbert space K, K is a self-adjoint Schmidt class 
operator acting on K, P(M)KP(M) = 0 whenever M € F andM # M_, 
and ô is a positive real number. Then there is a partition I of f 
with the following property: 1f P isa partition of $ which contains 


sm then ||D(K, PY, < Ô. 
Proof. H (9, : j € J} is an orthonormal basis of K, then 


2 2 
IKI = Z, KKO gP, 


so there is a finite subset F of J such that 


IKII < E _ tKKe, Pp? 17428". 
PREF 


Given any partition 2 of F, 
[|K-D(K, P 


> To > |<K9,, 0, >-<DK, PY, >I? 


2 2 
A {|< Ko, ,0, > ILDE, #9, >| 
-2Re[<K9,, P> LP, DK, Pyg > Wt 


> Z KKO, &>/P2IKI| DK, AIN 


> ||K||3=48°-2m°||K||_1|D&K, PJ], 


where m is the number of elements in F. By Lemma 4.4.7 (ii) the 


above inequality is equivalent to 
(32) IDK, P)|l3 < 8°+2m" IKI] ||D(K, P 


Let ņ be a positive real number such that 2m?||K\\n < 128°. By 


Lemma 4.4.4, there is a partition I of f such that ||D(K, FN <7. 
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If is any partition of f which contains Î., then IDK, P)|| < 7 by 
(13); and it then follows from (32) that 


||D(K, P)||3 < 48742m?|(K||n < 8°. 


LEMMA 4.4.9. Suppose that f isa simple chain of closed 
subspaces of a Hilbert space H, K is a self-adjoint Schmidt class 


operator on H, and I; se are partitions of f. Then 
2||S(K, P,)-S(K, P,yi|5 = ||D(K, P,)-DK, 9112. 
Proof. By (21), 


DK, 9) = K-S{K, P )-S(K, P)* (r= 1,2), 
SO 
DK, P,)-D(K, F,) 
= {S(K, ¥,)-S(K, ? HIS, P,)-S(K, Py. 
If we show that the two operators on the right-hand side of the last 


equation are orthogonal vectors in the Hilbert space ae it follows 


that 
|| DK, P,)-D(K, ?,)| [3 
= ||S(K, P,)-S(K, FDISK, ?.)*-S«K, 9*2, 


and we obtain the required result, since ||T*||, = ||T||, when 


Te Cz It is therefore sufficient to prove that 


[SK,?), SK, ? >] =0 (s= 1,2); 


? 


that is, (SK, ? )SK,?)) = 0 (r, s=1,2). 


Suppose that P consists of the subspaces 


| 
om 


0} = Mo ZM, 5... G M, = 
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(in f). It follows at once from the definition (19) of S(K, P) that 
S(K, J) maps M, into Mey Gj=1,..., n); moreover, by (20), S(K, ? ) 
leaves invariant each subspace in f. Hence the operator 


T = SK, JSK, ) satisfies 
(33) TUSEM y Galesi 
The range of the projection E, = E(M )-EM 1) is contained in M 


so (33) implies that TE, = EM ,_,)TE.. It follows that 


n n 
T= % TE = E EM _)TE; 


and, since E EWM 1) = 0, 


[l 


r(S(K, ? SCK, $ )) = (T) 
a Z (EM, TE) 
= X EEM T) = 0. 


THEOREM 4.4.10. Suppose that $F is a simple chain of closed 


subspaces of a Hilbert space K, K is a self-adjoint Schmidt class 


operator acting on H, and P(M)KP(M) ~ 0 whenever M € f andM£M_. 


Then the superdiagonal integral of K relative to F converges; the 


operator 


A = Jeg E(M)KE(dM) 


fies in the Schmidt class A and the approximating Riemann- 


Stieltjes sums S(K, P 2) converge to A with respect to the norm 


I Ila on Gp. 


Proof. Suppose that, for r = 1, 2, f is a partition of f, and 
S(K, ?, 2) is a Riemann-Stieltjes sum of the type (23); of course, 
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we assume that P and 2 are related as in (18) and (22). By 
Lemmas 4.4.7(i) and 4.4.9, 


ISK, ?,, 2-1K, P., 2114 
< ISK, ?,, 2 D-K, 9, || +11SK, 9-8, Polla + 
+|[SK, ?,)-SK, ?,, 211, 
< IIDE, P lla +271? ||DK, P,)-DK, Pallal IDE, Flo. 


Thus 
(34) ||S(K, ?,, 2,)-S(K, PF, 2)|| < 21DE, P DIII, ¥, II. 


Given a positive real number 6, let J be a partition of F which 
satisfies the conclusions of Lemma 4.4.8. If A and Ío both contain 
P, then |[DK,P)||, <8 (r= 1,2); and (34) gives 


(35) ISK, 9, 2,)-S(K,P,,2,)|| < 48 if Fg oP, Po cP 


The set of all pairs (P, 2) is directed by the inclusion relation 
(applied to P) and the last inequality shows that the Riemann- 
Stieltjes sums S(K, P 2) form a Cauchy net in the Hilbert space Ce 
Accordingly, this net converges to an element A of Cos with respect 


to || Il, (hence, also, with respect to the norm on BH) ). 


In Theorem 4.5.7, we shall prove the convergence of the 
superdiagonal integral under conditions much less restrictive than 


those assumed in Theorem 4.4.10. 


COROLLARY 4.4.11. If T is a quasi-nilpotent operator acting 
on a Hilbert space K, and Im T lies in the Schmidt class Co, then 
Te Cx 

Proof. Since Im T is compact, it follows from Theorem 1.8.8 


that T is compact. By Theorems 4.3.4, and 4.4.6, we deduce that 
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there is a simple chain J of closed subspaces of H, each of which 
AE l (3) As)=1 O< sc 1). 
iS invariant under T; and 
Thus 
T = 2i f- E(M)KE(dM), 
Jg (4) (V_V*)x = i<x,epe (xé L,(0,1)). 


where K is the Schmidt class operator Im T. The convergence of the 
l ; = It is readily verified that 
superdiagonal integral implies that P(M)KP(M) = 0 whenever M € $ 


and M £ M_, by Lemma 4.4.5, so the required result follows from (Ve) (s) = @s"/nt (n = 0,1, 2,...: 0 < s < 1). 


Theorem 4.4.10. 
Since polynomials form an everywhere dense subspace of L,(0,1), 


4.5. The operator of integration in L,(0,1) (5) Le, Ve, Ve, eo A L (0,1) 


This section is concerned with the operator V, acting on the $49 £ ) 
i here, as in $4.2, Y((x,, Xa, ...) denotes the closed subspace 
Hilbert space L,(0,1) (Lebesgue measure) which is defined by the os a i 
! generated by Xp Xo. -- .). Finally, we recall from $4.3.12 that, if 
equation 


(1) (Vx) (s) = T x(padt (6) Ly) =ix€L,(0,1) : x(s) = 0 almost everywhere on [0,A]} 
0 


(x €L,(0,1) : 0< s< D. when 0 < À< 1, then f = iL, : 0<A< 1} is a continuous chain of 


closed subspaces of L.,(0,1), each of which is invariant under V. 


Since V is a Volterra integral operator, with kernel h defined by The preceding discussion shows that V is an example of a linear 


( gei J, operator T, acting on a Hilbert space K, which has the following 


0 < < 1 
1 O<zt<sss<l) properties: 


, 


h(s, = 


(a) T is quasi-nilpotent, 


the discussion in $4.3.12 shows that V is quasi-nilpotent. The (b) there is a unit vector e in H such that 


adjoint operator V* arises from the kernel A(t, s), and is therefore 


(T-T*)x = i¢x, ede (x eH), 


given by 
1 
(2) (V*x)(s) = al x(ddt. (c) &e, Te, TS a ee 
S 
Hence We shall prove, in Theorems 4.5.3 and 4.5.4, that every such 
VOO ds) = if xtdt operator T is unitarily equivalent to V, and that the subspaces 
£ Ly O< À< 1) are the only closed invariant subspaces of V. These 
= i<% e>es), results are then used to obtain a condition, much more general than 
where e is the unit vector in L (0,1) defined by that of Theorem 4.4.10, which ensures the convergence of a 


superdiagonal integral. 
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Suppose that T is a bounded linear operator acting on a Hilbert 
space H, and conditions (a), (b) and (c) above are satisfied. In 


view of (b), the skew-adjoint part K of T is given by 
(7) Kx = 2x, ede (x eH), 


Since K is compaci, it follows from Theorem 1.8.8 that T is compact, 
so there is a simple chain F of closed subspaces of H, each of 


which is invariant under T. By Theorem 4.4.6 
(8) T= 2i Jeg E(M)KE(dm). 


LEMMA 4.5.1. Suppose that H is a Hilbert space, T € BK), 
and T satisfies conditions (a), (b) and(c). Let F bea simple chain 
of closed subspaces of H, each of which is invariant under T. Then 

(i) the closed subspace generated by the vectors E(M)e (M € F) 
is K; 
(ii) f is a continuous chain; 

(iii) if KM) = <EM)e, e> (M e 5), then Ọ is a one-to-one 
mapping from $ onto [0,1], and PM) < OM) whenever M,,M, E€ F 
and M, CM,. 

Proof. (i) Let X be the closed subspace generated by the 
vectors EMe (MM € F). Ifx eH and S(K, P 9) is a Riemann-Stieltjes 
sum of the type defined in 4.4(23), it follows from (7) that 


i 
mM 


S(K, F, 2)x E(L )KLE(M .)-E(M._,)|x 
1 J J J 


m z ⁄ CEM )-EM,_,)lx, e> EŒ Je 
E X. 


By (8), T is the limit, in norm, of operators S(K, P D; so Txisa 
norm limit of vectors SK, P 2)x in X , and therefore Tx € X. 
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Since e = E(He € X, and X contains Te,T e, ... by the argument 
of the preceding paragraph, it follows from property (c) that X = K, 
(ii), Gii) For each M in F. O(M) = <E(M)e, e> = \|E(M)e||7, SO 
0 < WM) < Ilell? =1. Suppose that M}, M, eS and M, CM, Then 


KM )-KM,) = LLEM,)-EM Jle, e> = |[[EM,)-EM, lel|* > 0. 
If O(M, ) = O(M,) then E(M,)e =-E(M,)e and, for each M in $, 
EM )EW)e = E(M)E(M ,)e = E(M)E(M,)e = E(M, )E(M)e. 


It now follows from part (i) of the lemma that E(V,) = EM), so 
M, =M,, This proves that 0 is a one-to-one order preserving 
mapping from F into [0,1]. 

Suppose that M e Í. If M #M_ then P(M)KP(M) = 0 by Theorem 
4.4.2, and it follows from (7) that 


0 = P(M)KP(M)e = '%4<P(Me, e>P(Me = %4||P(Mel|°P(Me. 
Thus P(M)e = 0, and 
O(M)-0(M_) = <[E(M)-E(M_)le, eX = <P(Me, eX =9, 


contradicting our earlier conclusion that 0 is one-to-one. Thus 
M =M_ for each M in F: that is, f is a continuous chain. 
It remains to prove that © maps ¥ onto [0,1]. Now 9({0} = 0 and 
oH) =1. 1€0<¢<1, let 
; E 2 
(9) M=ML: Le, |\|E(L)el|* > th. 
Then 
E(M) = MEIL) : L ef, \|E()el|? > th 


and M e Í since Í is a simple chain. By Theorem 1.7.11, EM) lies 
in the strong closure of {E(L) : L Eef, ||E(L)e| |? > th since the 
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mapping A — ||Ael|? is continuous for the strong topology on BCH), 
it follows that ||E(Me| i > t. Hence M 4 {03 and, since F is 


continuous, 


E(M) = E(M_) = VIEL): Lef, L ZM}, 


Thus E(M) lies in the strong closure of {E(L): L € f L 7 M}: It 
follows from (9) that ||E(L)el|2 < t whenever L e F and L 7 M: so 
the strong continuity of the mapping A — ||Ae||? now implies that 
||E(M)el|* < i Since the reverse inequality has already been 


established, ||E(M)e!|? = t; that is, QM) = t. 


In proving our next lemma, we shall make use of some simple 
properties of an isometric isomorphism from a Hilbert space K onto 
another such space K. Since x, xX = Ux, Ux) for each x in Has 
it follows immediately from the identity 1.7(2) that 
<x, y>= <Ux, Uy» whenever x,y eK. Furthermore, if T € BOO) 
and x,y ek, then 


<(UTU~')*Ux, Uy> 


i 


<Ux, UTU~'Uy> 
<Ux, UTy> 

<x, Ty> 

<T*x, y> 

<UT* x, Uy> 
¢(UT*U-1)Ux, Uy>; 


I} 


i} 


I 


l 


II 


so (UTU~!)* = UT*U-}, 


LEMMA 4.5.2. Suppose that, for r = 1,2, K is a Hilbert space, 
T e BCH ), and conditions (a), (b) and (c) are satisfied with 
T; K, e, in place of T, K, e. Let I be a simple chain of closed 
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subspaces of K, each of which is invariant under T. Then there 
is an isometric isomorphism U from G onto K, such that 
-1 
Ue, = &o, UT ,U~ = T,, 
(UM) : MeF l= f. 

Proof. By Lemma 4.5.1 (iii), the equation p M) = K<EMe., e> 
defines a one-to-one mapping 9, from T onto [0,1], and p (M) < P W) 
if M, Ne a and M C N. The composite mapping Ọ = gz" o @, is 

Ta 
one-to-one, from ae onto ca and XM) C P(N) whenever M, N € $ i 
and M CN; thus 9({0}) = {0}, pŒ) = K, and 
PMAN) = O(M)AQ(Y) (M, N €f). 
Furthermore 
CECOM) Jen e> = 0,(0(M)) = P M) 
<E(M)e,, e>) Mef). 


il 


ji 


It follows that, if M, N e F,, then 


LEM) Jey, ECPN) Je, 2 = LEW JEM) Jez, e> 
<E(O(V) APM) )e,, e> 
CECQ(NAM))e,, e3) 
<E(NAM)ey e> 
<E(N)E(Me,, e> 


= <E(M)e,, EW)e,>. 


If 


By linearity of the inner products, if a, 6, are scalars and 


M, Ny ef  (L<j<m, 1<k<n), then 


LE a ECM) ey, = B EOW, D)e 
= CÈ a EM Jey, È BEW e7. 
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In particular 
(10) |E a EM) )eg|| = |[E a EM )e; |]. 


For r= 1, 2, Jet Xx. denote the set of all finite linear 
combinations of vectors of the form EM)e., with M in F; by Lemma 
r 
4.5.1 Cì), X. is an everywhere dense subspace of H. If x € X, and 
r ? 


x is represented in two ways, 


m n 


J 


as a linear combination of vectors E(M)e, (M e F 1), then by (10), 
m n 
I Z a EO pez > , EOW De l 
m n 
a al de ae = 1 PEW Dell = ||x-xl| = 0; 


thus 


2 A a ECOM .) Jeg : 2 B,E@(N ,) ey. 


J 


It follows that the equation 
m m 
UC 2 GEM )e) = E, a EOM), 


defines (unambiguously) an isometric isomorphism U from X 1 onto 
X,; this extends by continuity to an isometric isomorphism, which 
we denote by the same symbol U, from K, onto Ne: 

Since PA) = K, and e, = EH e, it follows that Ve, = Ey. 
When M, N ¢ a 


It 


UE(M)E)e, = UEM A N)e, 


E(O(M A N))e, 
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| 


= E@WM) AVM))e, 
EM) JEW) Je, 
E(9(M) UEW)e,. 


By linearity, UE(M)x = E(0(@) )Ux whenever x € X}; since X} is 
dense in H,, it follows that UE(M) = E(0(M))U. Thus 
E(0(M)) = UE(M)U~!, and 0(M) = UCM), for each M in Iz SO 


F, = (9M): M ef} = (UMD : M efh. 
If T, = UT 0; then T is a quasi-nilpotent operator on Ko; 


and we have to show that T, = T,. If N €$, then N = U(M) for 


some M in Jo thus 
T,(N) = UT,U-'U(M) = UT,(M) C UM) = N. 
Since T} and T, both have property (b), 
(Pe ee UT en U2 
i U`! x, ep Ue, 
i<x, Ue,>Ue, 
i <x, en» e) = (T,-T5)x, 


Il 


Il 


for each x in i. Thus T, and T, are both quasi-nilpotent operators 
which leave invariant each subspace in se and they have the same 
(compact) skew-adjoint part. By the uniqueness clause in Theorem 


4.4.2, 
-1 
T, = T} = UT, U~". 
THEOREM 4.5.3. Suppose that V is the bounded linear 


operator, acting on the space L (0,1), which is defined by equation 


(1); and let T be a bounded linear operator which acts on a Hilbert 
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space H and satisfies conditions (a), (b) and (c) above. Then there 
is an isometric isomorphism U from L,C, l)onto H such that 
T = UVU. 


Proof. This follows immediately from Lemma 4.5.2, by taking 
jen A es 


THEOREM 4.5.4. Suppose that the bounded linear operator V 
acting on the space L,(0,1), and the closed subspaces L} O< A< 1) 
of L (0,1), are defined by equations (1) and (6). Then, if Mis a 
closed subspace of L,(0,1) which is invariant under V, there exists 


a real number A such thatO <A <1 andM = Ly. 


Proof. We have seen, in $4.3.12, that the family IL, :0<A< 1} 
is a simple chain Ji of closed subspaces of L, (0,1); and each L) is 
invariant under V. Suppose that M is a closed subspace of L, (0,1) 
and V(M) CM. The family consisting of the single subspace M is an 
invariant chain for V, and is therefore contained in a maximal 
invariant chain fo which is simple by Lemma 4.3.2. 

We now apply Lemma 4.5.2, with K = K, = L,0,1), T =T, =V 
and e; = e, = e, the unit vector defined by (3). We conclude that 


there is a unitary operator U on L,(0,1) for which 


Ue = e, UVU™ = y, {UL : 0 <A < I=, 


These conditions imply that uveu-} =- V”, and hence that 
OV"e=V"Ue= V"e(n=0, 1, 2,...). Since finite linear 
coombinations of the vectors V"e (n = 0, 1, 2, ...) are everywhere 


dense in L (0,1), it follows that U = I. Thus 


M ef, = {U(L)): O< AK MH={L, :0<dA< 1h. 
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The following lemma, which is a consequence of Theorem 4.5.3, 
will be needed in our subsequent work on the convergence of 


superdiagonal integrals. 


LEMMA 4.5.5. Suppose that T is a quasi-nilpotent operator 
acting on a Hilbert space H, and there is a unit vector f in H such 
that 

(T-T*)x = idx, Di (x € 50. 


Then T is compact, and the non-zero eigenvalues of T+T* are 
precisely the numbers +2/(2n-1)7 (n = 1, 2, ...), each having 
multiplicity 1. 

Proof. Since Im T is compact, it follows from Theorem 1.8.8 
that T is compact. The closed subspace M = ETE Tf, ...) of H 


is invariant under T, so M“ is invariant under T*. If x € M~ then 
(11) Tx = T xsi <x, f>f = T*x eM; 


hence M” is invariant under both T and T*, and therefore the same 


is true of M. If x, y € M% then, by (11), 


Tx, y> = <x, T*y> = <x, Ty). 


It follows that the quasi-nilpotent operator obtained by restricting T 
to Mis self-adjoint, and is therefore zero. We have now proved 


that 


(12) T(M) CM, T*(M) C M, T(M~) = T*(M~) = {0}, 


Let To be the compact linear operator obtained by restricting T 
to M. It follows easily from (12) that T is the restriction of T* to 
M, and that the non-zero eigenvalues of TotTi are the same as 
those of T+T*, with the same multiplicities. However, To satisfies 


the conditions of Theorem 4.5.3, and is therefore unitarily equivalent 
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to the operator V defined by equation (1). It now suffices to prove 
that the non-zero eigenvalues of V+V* are precisely the numbers 
+2/(2n-1)r (n = 1, 2, ...), each with multiplicity 1. 

For x in L,(0,1), 


(Vx+V*x) (s) 


i} 


iS aa x(Òdt 


= 2if Ddi x(t)dt 


It 


. S ry 
2iJ x(t)dt-i Cx, eX, 
where e is defined by equation (3). If A 4 0 and (V+V*)x = Ax, then 


zif x(t)dt-1¢x, eX = Ax(s) 


for almost all s in [0,1]. Upon multiplying the last equation by 


ya exp(—2is/A), we deduce that the absolutely continuous function 
S 
exp(-2is/A)l f x(t)dt-⁄ <x, e>] 
0 


has derivative zero almost everywhere on [0,1], and is therefore a 


constant k. Thus 
S 
iy x(t)dt = 2 Lx, e>+k exp(2is/A), 


and differentiation gives x(s) = c exp(2is/A) for almost all s in 
[0,1], where c= 2ik/A. It follows that, if A is a non-zero eigenvalue 
of ViV*, then every associated eigenvector is a scalar multiple of 
the vector e, defined by e,(s) = exp(2is/A); so A has multiplicity 1. 


Computation shows that, for any non-zero scalar A, 
(ViV*)e, = Aey—rAl1+exp(2i/A)le. 


Hence A is an eigenvalue of V+V™ if and only if exp(2i/A) = -1; that 


is, A= £2/(2n-1)z for some positive integer n. 
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LEMMA 4.5.6. Suppose that F is a continuous chain of closed 
subspaces of a Hilbert space K, K is a self-adjoint operator of 
finite rank m, the non-zero eigenvalues of K (arranged in order of 
decreasing magnitude and counted according to their 
multiplicities) are AD een À and (a_) is the real sequence defined 
by a, 4 = 4%, = 2/(2n—-1)7 (n = 1, 2, ...). Then the superdiagonal 


integral 


T = 2i Jeg E(M)KE(dM) 


converges, and its self-adjoint part satisfies 


Re || < 2 5 KALA 


Proof. Since M =M_, for each M in F it follows from Theorem 
4.4.10 that the superdiagonal integral converges and that T is a 
Schmidt class operator; by Theorem 4.4.6, Im T = K, and T leaves 
invariant each subspace in 5. Thus T = H+iK, where H is the 
self-adjoint Schmidt class operator 44(T+7"). By Theorem 1.9.2 and 


equation 1.9(3), there is a unit vector f in H such that 
(13) HI| = |<He, >I. 

If B is the operator defined by 
(14) Bx = <x, Df (x eH), 


the above reasoning can be applied with B in place of K; in this 


way we obtain Schmidt class operators A, S such that 
(15) A= A*, S= AHB = 2i Jeg E(M)BE(dM). 


Since the Schmidt class operators S and T leave invariant each 


subspace $, the same is true of the trace class operator TS. 
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Proof. The proof is divided into a number of stages. 
(a) We consider first the case in which F iS a continuous chain. 


There is an orthonormal system iO in H such that 


(22) Kx = i A <x, >Y, Et, 

and we define approximating operators K,, K,, ... of finite rank by 
n 

(23) K x = . 2 A <x ODO, x e Í). 


By Theorems 4.4.10 and 4.4.6, the superdiagonal integral 
(24) T = 2i Jeg E(M)K_ E(dM) 


converges, and T, isa Schmidt class operator which leaves 
invariant each subspace in F and has skew-adjoint part K x If 
m > n > 1, then 

m 


(25) KaK = E AG eH 


=n 


(so the eigenvalues of K a Kn are AnI? peed A and 


m 


T -T =2i Jg E(M) (K -K )E (dM). | 


By Lemma 4.5.6, 


m-n 


IRT TDI] 2 E apyl 


where a31 = 0, = 2/(2r—1)z7. ‘Furthermore, by (25), 


mT TI = Ka Eal = gyal 


(26) OERE AE 
j=l 


n+j 
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It follows from (21) that the series 
(27) O5 alà] i 
. naid J] jf 


converges. If V is any positive integer then, since both the 


sequences CA and (A) are decreasing, 


m-n 
Iaa = Antl > a Aaj 
j=1 
N: oo 
< eae = Ae ce a lAl; 


N o 
Z Agee ae ae aa a Jà. 


Given a positive real number <€, we can choose N such that 
oO 


2 È aia,| <he, 
j= N41 J J 


since the series (27) converges; then, since An — 0 as n— %, we 
can choose an integer No such that 


N 
Aa! a > a] <'%e whenever n > No 


It follows that 7-7 al < e whenever m > n > Nọ Thus (T) is 
a Cauchy sequence in BH), and so converges in norm to an operator 
T in BH). Since T „ is compact and leaves invariant each subspace 
in Í, the same is true of T, so T is quaSi-nilpotent by Corollary 
4.3.11. Furthermore 

Im T = lim Im(T_) = lim K = K. 


no œ n => ow 


By Theorem 4.4.6, the superdiagonal integral of K relative to f 


converges, and 
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2i Jg E(M)KE(dM) = T. 


In view of (23) and (24), it follows from Lemma 4.5.6 that 


I Moos 


| |Re Ta <2 


alà. | 
j 1 7 7 


<2) De a A 
=1 77 


< Ag a-iltlAga| 


Re T || < 4 } 
n T 1 `- 2n-l 


n 


we obtain the same inequality for T be taking limits as n — ~. 
This completes the proof of the theorem when F is a continuous 
chain. 

(b) We now consider the case in which f isa simple (but not 
necessarily continuous) chain. Our first step is to embed Hin a 
larger Hilbert space Ko and to construct from f a continuous chain 
J of closed subspaces of Wg For this purpose we assign, to each 
subspace M in F such that M 4 M_, a separable infinite-dimensional 


Hilbert space Hs and we form the Hilbert direct sums 
(28) H, = Zo {Hy : M eS, MEM 3, Hi =He H,. 


IfM eÍ and M4 M_, then PM) (HO is a 1-dimensional subspace 
of K, SO 


(29) Ky = PMH @ Hy 


is a separable infinite-dimensional Hilbert space, and can be 
embedded canonically as a subspace of Ho (= Ke K). It follows 
from the discussion at the end of 94.3.12 that there is a continuous 
chain a of closed subspaces of Kap When M € F and M = M_, we 


define Hr = Ky = {0}, and denote by Ja the trivial chain consisting 
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of Ki only. We shall identify each of the Hilbert spaces K, Wos 
Hos Ku with its canonical image in Ko l 

If M e$, the subspaces M_ and P(M)(K) of K are mutually 
orthogonal, since P(M) = E(M)-E(M_). From this, together with (28) 
and (29), it follows that the subspaces 


M, ae KH, (Led, LCM_) 


are pairwise orthogonal. Given any M in F and N in 5 NC Nig 


and we can form the subspace 


X(M,M)=M_@Ne@ Xo HK, : Le LCM_} 


L 


of Ho As N increases continuously from {0} to Ky, X(M, N) 


increases continuously from 


M @ tei, :LeF LCM? 


L 


toM e Yo tH, : Lef, LGM} 
With this is mind, it is not difficult to show that 


Jy = XM, N): Mef, Nef, 


is a continuous chain of closed subspaces of Ko; the details of this 
verification are left as an exercise for the reader. 


(c) Let Ko be the compact self-adjoint operator on Ho defined by 
(30) Ko(x+x;) = Kx Q eN, X4 e K). 


Since Ky has the same non-zero eigenvalues as K, and J, is a 
continuous chain, it follows from part (a) of this proof that the 
superdiagonal integral | 

Ty = 2i Jeg E(M)K ECM) 


O 
converges: furthermore 
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1 * 4 a Aa ltl, l 
(31) Lo See ies ata ea 


Of course, To is a compact quasi-nilpotent operator with skew-adjoint 


. ; l . Ç 
part K,, and leaves invariant each subspace in Jo 


(d) We shall prove that 
(32) To) = {0}. 


For this purpose, suppose that M € F and M#ZM_. Then the 


subspaces 
X, = XM, 0) = M_@ Do K, : Lef, LCM_} 
X, = XM, K =M @ 2O Ws fee FCM) 
of Ho belong to the chain F , and 


(33) X,CX, xX, NXT = Ky = PMH) @ Ky 


Let E, be the projection from Ka onto x, and consider the operator 
(E,-E ,)K(E,-E,). Since 


(34) (E,-E,)(H,) = X, NXT = PM GO e Ky, 
and Ky C o it follows from (30) that 
K,(E,-E,) (x+x,) = KP(M)x (x EK, X4 e K). 


Since the right-hand side of this last equation is in H and so 


orthogonal to Kiji (34) now implies that 
(E, -E )Ko(E E1) (x+x,) = P(M)KP(M)x = 0. 
Thus (E-E, )K (E-E) = 0, and 


Im(E,-E,)T (E,-E,) = (E,-E,) (im Ty) (E,-E;) 
= (E,-E,)K(E,-E,) = 0; 


SUPERDIAGONAL REPRESENTATION 219 


so the operator (E,-E,)T(E,-E,) is self-adjoint. 

Given any bounded linear operators A, and A, on Hes which 
leave invariant the subspaces X, and X,, we have AE. = EAE, 
(equivalently, (I-E,)A, = (I-E )A,U-E)) for i,j = 1,2; thus 

(E,-E ,)A,A,(E,-E 1) 
= (E-E pU-E pA 14E (E-E) 
= (E-E D (-E ,)A ,U-E ,)E,A,E,U-E,) 
= (E,-E,)A (E,-E ,)A(E,-E,). 
By applying this with A, and A, powers of Tọ we prove inductively 
that 


[((E,-E,)T (E,-E,)" = (E,-E,)Ti(EQ-E,) 
(n=1,2,...). Thus 
[LE -E DTE -E yl < IT Sil, 


and so (E,-E pT (E,-E,) iS quasi-nilpotent; since it is also self- 


adjoint, 


(35) (E -E ,)T (E-E) = 


l 
© 


If x eH, (C H,) then, by (30), 
(Im T,)x = Kox = 0, 
so Tox = Tx; furthermore, 
x €X, NX, = (£,-E,) A). 


Now To leaves X, and X,» invariant, thus To leaves X7 invariant, 


and so 


Leek, TeTri 
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Thus Tox € X, Xj, and so 
Tx = (E,-E,)T (E,-E 1)x = 0 


by (35). This shows that T Hp = {0} whenever M c F and 
M #£M_, and it follows from (28) that TK) = {0}, proving (32). 
(e) By (30) and (32), 


TO) = 10}, (Im To (H,) = KK) = 10}, 


SO TH.) = {0}. Since H is the orthogonal complement of G in Koy 
it follows that Tc) CH, This, with (32), shows that there is a 
bounded linear operator T on K such that 


T(xtx,) = Tx (xe K, x, € K), 
whence also 
| T*(x+x;) = T*x (x eH, x, eK). 
It is clear that T is quasi-nilpotent, Im T = K and, by (31), 


lAn -1ltlAsn| 


i 7 2 
[|2(T4T*)|| = ACT y+ TH) || < jE 1 2n-1 


it M8 


n 
Given M in J, M = X(M, Kp NH (asa subspace of Ho: since 
X(M, Ka) and H are both invariant under To so is M, and thus M (as 


a subspace of J) is invariant under T. It now follows from Theorem 


4.4.6 that the superdiagonal integral 
2i Jg E(M)KE(dM) 


converges, and that its value is T. This completes the proof of 
Theorem 4.5.7. 


The hypotheses of Theorem 4.5.7 are satisfied if K is a 


self-adjoint element of a von Neumann-Schatten class Co where 
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1 < p < œ, and P(M)KP(M) = 0 whenever M € $, M4M_; for the 


index q conjugate to p satisfies q> 1, and 
Sat | < wr DUS [A [PP < o, 
A good deal more is known [45, 20]; if K satisfies the above 
conditions and 
T = 2i Jeg E(M)KE(dM) 
then T € oe furthermore, the mapping K — T is bounded relative to 


the norm on Cy These results fail when p = 1 (consider the operator 
V defined by (1)). 


4.6. An alternative treatment of the integration operator in L, (0,1) 
The proofs given in $4.5 for Theorems 4.5.3 and 4.5.4 are 
heavily dependent on the theory of superdiagonal representation of 
compact linear operators; in this section, we give a more elementary 
treatment. Once again, we assume throughout that T is a bounded 
linear operator acting on a Hilbert space H, and 
(a) T is quasi-nilpotent, 


(b) there is a unit vector f in H such that 
(T-T*)x = i <x, f (xe HK, 
(c) El, Tt, T'E...) = K. 
Condition (b) may be expressed in the form 


(1) T-T* = iK, 


where K is the self-adjoint operator given by 


(2) Kx = <x, Df (x eH). 
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We shall consider the entire function defined by 


(3) 


@(A) = 1-iA CAT) 1, £>. 


LEMMA 4.6.1. For all complex numbers À and p, 


PANIG) = LAA CAT, (nT) TD. 
I 


Proof. Since 


OD (HETO MLAT) 
= (4pT*) ACS RT*)-pUAT)+4XR(T-T*) MAT)! 
= AAT) -ERTS iNT KAT)! 


it follows that 


Now 


So 


L(PTA KUADE >X = LADE E> GET“, £>. 


Thus 


and 


ALATI, uT) IE 
= (Ap) (GET) ATIE, £2 
= ALMATI E, P-RR(GET*) IE Y+ 
+iNp LUFET IKAT)! £> 


KAT! = CUGAT) TE, YE, 


(Ap) LUAT) EE, uT) 
= ALAT) IF, X-u LUTS E, £4 
+i CU+AT) TE, > LATS E, 2, 
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1A CATY, (uT) > 

L-iACCGAT) IE, Pin CUT *) TE, £>+ 
ALAT) TE, E> KETS IE, > 

[I-A LATTE, Eig GET), £] 

= PA) Op). 


gi 


į 


LEMMA 4.6.2. For all complex numbers A, Ọ(À) = exp(-iA). 


Proof. By Lemma 4.6.1, 


PDP = 142m NIAD 4 |?, 


and 


O(A)O(H) = 1 


ifu = A; Furthermore, (3) can be written in the form 


(4) P(A) = 1+ = (Dri CTTI, De. 


It follows that Ọ is an entire function which does not take the value 
zero, 0(0) =1, |P(A)| = 1 when A is real and |Q(A)| > 1 when 

Im A > 0. By Theorem 1.4.3 there is a positive real number a such 
that Q(A) = exp(-iaA) (A € C). Since the coefficient of A on the 
right-hand side of (4) is -1, it follows that a= 1. 


Alternative proof of Theorem 4.5.3. We have seen that, if the 
unit vector e in L (0,1) and the integration operator V are defined 
by equations 4.5(3) and 4.5(1), then V satisfies conditions (a), (b) 
and (c) above, with f replaced by e. Suppose that T is any other 
operator which satisfies these conditions. It follows from Lemmas 
4.6.1 and 4.6.2 that 
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LAK (LATTI, (pT) 
I 


= exp i(u-A) = 1,AcH L(IAV) te, (uyy te’, 
. i 
whence 


KATTE, uT) TEY = LAV te, (uV te), 


for all complex À and u. By expanding both sides of this last 


equation in powers of A and 4, and comparing coefficients, we obtain 
CT™E Tf) = Cyme Vre’ (m,n = 0, 1, 2,...). 


By linearity of the inner products, if p and q are polynomials with 
complex coefficients, then 
<aAT)f, AT) = <pV)e, V)e ; 
in particular 
(5) [eE] = pell. 
Since T and V both satisfy condition (c), the set of all vectors 
p(V)e is an everywhere dense subspace X of L,(0,1), and vectors of 


the form p(T)f are dense in the Hilbert space K on which T acts. If 


x € X and x is represented in two ways, x = p(V)e = q(V)e, then 
eD- = [Peel] = 0, 

by (6), and so p(T)f = qg(T)f. It follows that the equation 

(6) Up(V)e = p(T)f 


defines (unambiguously) an isometric isomorphism from X onto an 
everywhere dense subspace of H; this extends by continuity to an 


isometric isomorphism, which we denote by the same symbol U, from 
L,,(0,1) onto K. Forn =0, 1, 2,..., 


SUPERDIAGONAL REPRESENTATION 225 
UVV"e = uyntt, 
= Tortig 
TT "f 


= TUV"e. 


By linearity and continuity, UVx = TUx for each x in 
Lle, Ve, Vĉ?e, ...) (= L,(0,1)); so UV = TU, T = UVU™!. This 


completes the alternative proof of Theorem 4.5.3. 


In the following lemmas, which lead up to an alternative proof of 
Theorem 4.5.4, the unit vector e in L,(0,1) and the integration 
operator V are defined as in $4.5, and Ko is a closed subspace of 


L, (0, 1) which is invariant under V. The restriction of V to H is 


denoted by V,, and P is the projection from L,,(0,1) onto Hes 


LEMMA 4.6.3. Ho = £(Pe,vPe, V*Pe, ...). 


Proof. If K = (Pe, VPe, V?Pe, ...), then Ki a Ho in order to 
prove that K = Bed it is sufficient to show that Ho N Hy = {0}. 
Clearly VÆ) c K, so f 


VASRA 
Suppose that y € Ko N W: Then 
<y, e> = Py, e> = <y, Pe) = 0, 
and Vy-V*y = iy, eSe=0. Thus Vy = V*y, 
Vy e WKY CH, vey e V= c K, 
and so Vy = V*y € H N W: We have now proved that 


y, e> = 0, 
Vy = Vrye I n KI, whenever y € Ho N Ky. 
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Given any x in Ko N K7, it follows from (7) that 
(V*)" x eH, n ey and hence that ((V*)" x, e> = 0, for 
n=-0,1, 2,... Thus <x, V"e»> =0(n=0,1, 2, ...) and, since 
L(e,Ve,V7e, ...)= L,(0,1), x= 0. This shows that Hy N HT = {0}, 


as required. 
LEMMA 4.6.4. For all complex numbers À, 
1-iACUGAV)"*Pe, eX = exp(-i||Pel|72). 


Proof. If Ko = {0} then P= 0 and the result is apparent. We 
” suppose H 410}, so that Pe 4 0 by Lemma 4.6.3, and we define 
o = ||Pe|| © 0). If x, y eH, then 


<Vx, y> 
<x, V*y> 
[Px Vty> = <x, PV*y>. 


Since Vity, PV*y eH), it follows that Voy = PV*y. Also, 
Voy = Vy = PVy, because Vy € Ng Thus 


i 


<x, Voy> = <Vox y>? 


il 


jl 


P(Vy-V*y) 

i<y, e>Pe 

i<Py, e>Pe 

= i<y, Pe>Pe (y € Ko). 


Voy -Yoy 


ii 


If f denotes the unit vector o™}Pe in Ki and T = 0? Vo then 


the last equation asserts that 
Ty-T*y = i <y, DE (y eH). 


Now T is quasi-nilpotent, and it follows from Lemma 4.6.3 that 
EGTE Ti wo) = Ko Hence T satisfies conditions (a), (b), (c) 
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above and, by Lemma 4.6.2, 
LiAC(LAT) “1, fY = expl-ià) 
for all complex A. Thus 


exp(-i||Pe||7A) = exp(-io7A) 


I} 


1-io2AC(402AT)~“1 4, £Y 


Í 


1-iAC(I4AV,)"*Pe, Pe> 


= LiACPWAV,)“'Pe, e). 
Since (AV) Pe eH, 


il 


PAV) Pe = (AV) Pe 


II 


ÈE (AV) Pe 
= 0 


S (-AV)"Pe = (AV) }Pe, 
0 


n = 


and so 
exp(-i||Pe||7A) = 1-iAC(4AV) “Pe, eX. 


LEMMA 4.6.5. Suppose that P, is the projection from L,@,1) 


onto a closed subspace K. which is invariant under V (j = 1,2). If 
|[Pyell = ||Ppell, then H, =H, 


Proof. If ||P ell ss ||P pel | = 0 then, by Lemma 4.6.4, 
LidC(LAV)“'P, e e>= 1-iAC(U4AV) }P e, e> exp(—io“)) 


for all complex A. By expanding both sides of this equation in 
powers of A and comparing coefficients, we obtain 
<V"P, e, e) = <V"P, @, e> (n=0,1,2,...). Thus 


(8) <P eP,e, (V*Pe> = 0 (n=0,1,2, ...). 
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A straightforward computation based on equation 4.5(2) shows that 
(V*)Re = (n!)*-iPe, 


where the vector e, in L(0,1) is given by e (s) = (1-s)". Thus 
the set of finite linear combinations of vectors e, Vte,(V*)7e, ... 
consists of all polynomials, and is therefore dense in L (0,1). 
This, together with (8), shows that P} e = P,e. By Lemma 4.6.3, 
K. = &(P e, VP e, V?P e. ...), so K, = K.. 

j j j j 1 2 

Alternative proof of Theorem 4.5.4. Suppose that G isa 

closed subspace of L,(0, 1) which is invariant under V, and that P, 
is the corresponding projection. Let o = IP, e||, so thatO <o< 1, 
and define a closed subspace K, of L,(0,1) by 


K, = {x € L,(0, 1) : x(s) = 0 almost everywhere on [0,1-07]}. 


Thus K, is invariant under V and, in the notation of equation 4.5(6), 
K, = Ly with À = 1-02. The projection P, from L, (0,1) onto I is 
given by 


(P D(a \O0 0 < 5 <1’, 
x) (s) = 
: x(s) lo? < s< 1, 


So 


1 
||P; x||? = J 2 [x9 ds (x € L,(0,1)). 


In particular, ||P,e|] = o = ||P ell; by Lemma 4.6.5, H, = K, = Ly, 


where A = 1-0”. 
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